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ABSTRACT 


Technology mapping is an effective appraiser for any technology. It assists in 
research and development of the firm by providing information for decision maker in 
regards of technical progress and current market trends. Patent documents are one of 
major source of technical and commercial knowledge. They are also valuable source 
to be used in technology mapping for measuring technology changes, trends and 
classifications of technology in various domains. I 

We propose an interactive methodology (semi-automated) for classification. Our 
methodology consists of using attributes like Assignee and Author name, I-Class and 
U-class number, Citation along with keywords and keyword-pair. We built an 
discriminant functions based on these attributes for classification. To demonstrate this 
methodology we have selected technology case of Supply Chain Management. Supply 
Chain Management is a very broad field and consists of many sub-technology 
domains. The complete classification of the patent documents in the field of supply 
chain management up to level 3 has been done using the proposed method. 
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CHAPTER 1 


INTRODUCTION 


Data Mining or Knowledge Discovery in Databases (KDD) as it is also 
known, is the nontrivial extraction of implicit, previously unknown, and potentially 
useful information from data. This encompasses a number of different technical 
approaches, such as clustering, data summarization, learning classification rules, 
finding dependency net works, analysing changes, and detecting anomalies. 


1.1 Patents and Technology Mapping (Verbeek A., et al., 2002) 

The relationship between science and technology and its role in economics 
development has received widespread attention. Publication and patents provides the 
primary ‘raw material’ for building and development system. Bibliometric 
information, i.e. scientific publication, patents and citations to these publication and 
patents, constitutes and adequate information source for the mapping of field and sub 
field of scientific and technological enquiry as well as a mean of assessing the 
performance of the major actors in those fields and sub fields. Scientific publications 
and patents constitute a generally accepted, though not always perfect, output 
indicator of scientific and technological activity. New ideas, on the one hand, and new 
R&D professionals, on the other hand, are the two major outputs of academic science. 
The latter are relatively easy to measure, whereas the output of new ideas is more 
difficult to grasp in objective, and preferably quantitative, way. The best measure that 
exist at the moment, consist of using scientific publications and patents and citations 
to these publication provides a measure of scientific productivity, while the number of 
patents produced by a particular organization, country or region provide a first order 
indicators of its technological vitality. The area of research utilizing tjie information 
contained in research publications to obtain a better view on an actor’s scientific 
output is labeled ‘bibliometrics’. 
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The analysis of patent information is considered to be one of the most 
established, directly and history reliable methods of quantifying the output of a 
technology system (Soete and Wyatt, 1983). There use occurs more often in 
technology mapping compared to other innovation indicator. A patent hence is a 
‘property right’ on an ‘official sealed’ claim (Grupp, 1998). 

1.2 Supply chain management 

For the purpose of classification we have chosen Supply chain Management. It 
is very board field and it consists of many sub technologies. To study supply chain 
management and its sub technology domain Books, journals and Patents need to be 
referred. Patents are a rich source of information about technologies. IBM, 
i2 Technologies and Accenture LLP etc are the major players in this field and they 
have many patents in supply chain and related technology. 

1.3 Problem Identification 

Mirdha, Bhuvanesh (2004), used K-Nearest Neighbor (k-NN) algorithm 
method for classification of the patents in subclasses. This thesis is further to extend 
the methodology by developing discriminant function approach as classification tool. 

1.3.1 Methodology 

In this thesis we have used other indices like author name, assignee name, 
citation, key word, keyword-pairs, I-class, U-class as indicator for classification. 
These are associated with every patent that is present in USPTO and EPO. These 
indices are called metadata. We will use these indices to develop classification 
method for patents with objective of maximizing precision and accuracy. 
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1 .4 Organisation of Thesis 


Technology Mapping is an important part of Research and development. It 
provides information about cunrent market trend and about technical progress in 
specific technology area. 

This thesis is organized as follows: Chapter 2 discusses the theoretical and 
research work that have been done in technology mapping, text mining area and the 
work done in field of document classification. Chapter 3 discusses the major issues 
and broad classification of supply chain management. Chapter 4 discusses step-by- 
step methodology followed for application of technology mapping using discriminant 
function approach. Chapter 5 discusses results obtained using discriminant function 
and in chapter 6 conclusion and future scope of the proposed work are discussed. 
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CHAPTER 2 


LITERATURE REVIEW 


No other innovation indicator can be traced back over comparatively long 
periods of time, may at the same time be disaggregated at a very low level allocable to 
individual economic units, and is also precise and accurate insofar as identification of 
the timing of the innovation event is concerned (Grupp, 1<?98). 

2.1 Patents 

A patent is a document, issued by an authorized governmental agency, 
granting the right to exclude anyone else from the production or use of a specific new 
device, apparatus, or process for a stated number of years. The grant is issued to the 
inventor of this device or process after an examination that focuses on both the 
novelty of the claimed item and its potential utility. The right embedded in the patent 
can be assigned by the inventor to somebody else, usually to his employer, a 
corporation and/or sold to or licensed for use by somebody else. Only the potential 
threat of or an actual suit in the courts for infringement damages can enforce this right 
(Griliches, 1990). 

For the claim to be recognized by other competing companies, all property 
right details have to be made public. The purpose of the patenting system is the 
protection of the inventor. Without property rights, technological knowledge would be 
public property (i.e. the public good character of knowledge) and competitors would 
be able to imitate inventions without penalty and claim any piece of new knowledge 
to be their own. The inventor is granted a temporary monopoly situation and, as a 
consequence, he is assured of the benefits by patent right that may derive from his 
innovative efforts. Via monopoly granted by the patent system, the incentive to 
innovate is deemed to be enticing enough for private inventors so that a sufficient 
number of innovative ‘efforts’ are made, which in turn favour technological advance 
and economic growth (Verbeek, A., et al., 2002). 
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2.2 Patent Document - Closer Look 


Generally a patent document has the following information. (Annexure A to 
USPTO, Annexure B to EPO) 

Patent Document 

• TITLE PAGE WITH BIBLIOGRAPHIC INFORMATION 

• TEXT 

DESCRIPTION OF THE INVENTION 
PREFERRED EXAMPLE IN DETAIL 
DRAWINGS/DIAGRAMS/FLOW SHEETS 

• CLAIM 


2.2.1 Title Page 

The title page containing bibliographic information is well indexed in most 
international databases dealing with patent documentation (e.g. Derwent). A quick 
glance at the title page indicates all the bibliographic details contained in it. This 
bibliographic information is summarized as follows: 


TYPICAL BIBLIOGRAPHIC INFORMATION 

• COUNTRY OF PUBLICATION 

• NATIONAL PATENT CLASSIFICATION 

• INT. PAT. CLASSIFICATION 

• TITLE 

• ABSTRACT 

• INVESTOR 
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• APPLICATION 

• APPLICATION NO. 

• SERIAL NO. OF PATENT 

• DATE OF APLLICATION 

• PRIORITY DATE 

• PRIORITY COUNTRY 

• REFRENCE CITED BY EXAMINER 


2.2.2 Text 

The various elements constituting the text of the patent document are 
illustrated below. 

TEXT 


• OBJECT OF INVENTION 

• PRIOR ART 

• EXAMPLES WITH SUPPORTING DATA OR OTHER SPECEFIC 
EMBODIMENTS (APPARATUS, FIGURE, ETC.) 

• DISCLOSE/DESCRIBE INVENTION ALSO INCLUDE 
DIAGRAMS/FORMULAE, IF NECESSARY 

• CLAIMS DEFINING THE MONOPOLY CLAIMED (THIS 
HELPS IN FINDING WHAT CONSTITUTES INFRINGEMENT) 

The “object of the invention” clearly defines the essential feature of the 
invention. The section on “Prior art” surveys the literature from both open literature 
and previous patent documents. After a brief description of various aspect of the 
invention, the specific embodiments, e.g. of various specific constructions or 
apparatus or chemical examples with supporting data, illustrating all aspect of an 
invention, follow. The “Claim” defining monopoly claimed is of great significance as 
these help in not only understanding all the features of the invention that are protected 
but also helps in finding what constitutes infringements. Drawings, circuit diagrams, 
Flow sheets, formulae, etc. are also used to quantify or even demonstrate the 
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invention. It will be clear that patent documents cover all aspects of an invention for 
which one seeks protection. Therefore, an appropriate study of this document can be a 
very important source of information and knowledge. It should be appreciated that a 
patent document is not necessarily (most often not) equivalent to ‘technology know- 
how’, however it is expected to carry relevant information so that a person well 
versed in the art of the subject can reproduce the invention (Ganguli, P. and 
Blackman, M., 1995). 

2.3 Patent and its role in innovation 

According to Grupp (1998), a patent has a number of specific properties or 
functions. First, a patent grants the owner the exclusive right of exploitation of a 
precisely defined technical advancement for a specific period of time. Three 
conditions need to be fulfilled for this grant to be made: novelty, inventive step and 
the possibility of commercial application. Secondly, the innovation-stimulating 
function of the patent system is supplemented by its information-dissemination 
function, which is achieved by the publication of the patent. The information function 
thus is a second qualitative property of patents. Patents can thus be used by 
technologists other than the inventors with the purpose of obtaining insight into the 
progress of technological knowledge. From an economic point of view, patents 
therefore have the advantage that the information contained in the patent is made 
publicly accessible, thus fostering the diffusion of technological innovations. The 
patent system fulfils an important role in the information diffusion in the sense that it 
avoids needless duplication of R&D efforts, which in turn is believed to accelerate 
technological progress. In return for obtaining a temporary monopoly, the inventor 
indeed has to make his invention public so that other inventors can contribute to yet 
further advances in the state-of-the-art in a particular field of technology. A third 
function relates to the output function created by patent documents. Successful R&D 
activity is usually followed by a patent providing detailed information (e.g. date, time, 
circumstances, location etc.) on the activity itself. This property can be used for the 
measurement of innovations. In relation to the property versus information functions 
of a patent, it has been debated to what extent the patent system also enhances social 
welfare. The system stimulates innovations and technological competition, but 
simultaneously it grants monopoly rights in order to keep the incentives for private 
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inventors as high as possible. This might disturb regular competition, albeit 
temporarily. Economists like to point to the fundamental dilemma between 
appropriation and diffusion of knowledge to foster economic progress (Arrow, 1994). 
The economic literature suggests various instruments that can be used by policy- 
makers to deal with this dilemma. In particular, both the length, i.e. the duration in 
time, and the width, i.e. the range of claims made and granted in the patent document, 
of the patent protection are shown to be instrumental in influencing the contribution 
of the patent system to the genesis of social welfare (Granstrand, 1 999). 

The relationship between patents, as a measure of innovative output, and 
R&D, as a measure of innovative input, has been investigated repeatedly. The results 
of these studies indicate a strong and statistically significant relationship between 
R&D expenditures and patent counts in the cross-sectional dimension, i.e. across 
firms and industries. Pakes and Griliches (1984), for example, found that firms that 
spend more on R&D possess more patents. In the within-firm time-series dimension, a 
statistically significant relationship between R&D and patent counts isl found too, but 
this relationship has appeared to be weaker. In a recent study, Arundel and Kabla 
(1998) showed that large R&D intensive firms do not patent a higher percentage of 
their innovations than firms with low R&D intensities. 

2.4 Filing a patent application (Verbeek, A., et al 2002) 

An inventor (individual, agency or company) wishing to protect an invention 
in a particular country files an application with that country’s patent office. In the 
application, one or more claims will be made, showing those aspects in which the 
product or process has to be considered inventive. During the examination procedure, 
these claims will be subject to further assessment and comparison with the 
technological frontier, the so-called prior art examination. It is possible to obtain 
protection in a (large) number of countries at the same time. Up to one year after the 
‘priority year’, the applicant can file for a patent in any desired country, at the 
European Patent Office (EPO) for simultaneous protection in more than one country, 
or at the World Intellectual Property Organization (WIPO) for an even broader 
protection. At present, the national systems in EU countries and the EPO system work 
in parallel, although for many years a Community Patent has been advocated in order 
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to reduce the inefficiencies and the costs associated with the current EPO procedures 
(patent applications now have to be translated into the various languages of the 
member countries in which they are applied for, making the procedure expensive and 
elaborate). At present, the political debate on the Community Patent has not yet been 
closed, although the economic arguments in its favour are obvious. An important way 
in which patent systems differ concerns their publishing and granting procedures. 
Until recently, a USPTO filing tended only to be published after the patent had been 
granted. These grants do not follow a strict timetable and can in many cases take up to 
five years (OECD 1994a, b). In the EPO system (and nowadays also in the USPTO 
system), a patent is disclosed 18 months after priority application, regardless of 
whether it has been granted or not. Given the historic problem of having to deal with 
different lead times between filing and publication, priority year dates have been more 
appropriate when making year-to-year comparisons between USPTO and EPO 
statistics than, for instance, the publication dates or the dates of the patent grant. 
Priority year dates are of course older than the publication dates. Differences in 
patenting procedures stem mainly from a different emphasis in patent philosophy. In 
the USPTO system, patent protection is focused on the protection of the rights of the 
inventor. The EPO system aims first of all at the timely diffusion of new 
technological information in order to stimulate the rate of technological progress. 

I 

i 

2.5 Indian patent law and literature (Biju, A. and Moitra S., 2001) 

Indian industry has had to face two particularly daunting challenges in the 
1990s. A protected market has given way to a liberalized environment, where it faces 
competition from imported products. Indian industry has also had to cope with rapid 
technological developments and innovation occurring in both product and process 
technologies. 

To come up with new products and processes, Indian industry needs to have 
access to detailed information on technological innovations that it has to compete 
with. While a search of patent literature and innovation surveys carried out in a 
competitor's parent country could give some idea of innovations taking place there, 
such information could be misleading because the competitor need not introduce that 
product in the Indian market. 


9 



A more reliable method of assessing competition could be an analysis of 
Indian patent data. Since competitors would normally apply for an Indian patent only 
if they intend to exploit an innovation in the Indian market, an analysis of Indian 
patent data could provide firms with information that could help in their strategic 
planning efforts. 

2.5.1 Indian patent office literature 

The patent law of 1970 restricted the fields of patentability, only grants 
process and not product patent in food, pharmaceutical and chemical fields, restricts 
the term of patents and has an elaborate system of licenses to ensure that patents are 
worked in India. After the globalization of the economies and increasing role of WTO 
norms, it became imperative for the countries to oblige these norms. One of the 
agreements relating to the patent policy was TRIPs (Trade Related Intellectual 
Property Rights). 

Salient features of the Patents (Amendment) Ordinance, 2004 (source: www. 
indianembassy.org /Economy) 

a) Extension of product patent protection to all fields of technology (i.e., drugs, food 
and chemicals); 

b) Deletion of the provisions relating to Exclusive Marketing Rights (EMRs) (which 

I 

would now become redundant), and introduction of a transitional provision for 
safeguarding EMRs already granted; 

c) Introduction of a provision for enabling grant of compulsory license for export of 
medicines to countries which have insufficient or no manufacturing capacity, to meet 
emergent public health situations (in accordance with the Doha Declaration on TRIPS 
and Public Health); 
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d) Modification in the provisions relating to opposition procedures with a view to 
streamlining the system by having both Pre-grant and Post-grant opposition in the 
Patent Office; 

e) Addition of a new proviso in respect of mailbox applications so that patent rights in 
respect of the mailbox shall be available only from the date of grant of patent, and not 
retrospectively from the date of publication. 

f) Strengthening the provisions relating to national security to guard against patenting 
abroad of dual use technologies; 

g) Clarification of the provisions relating to patenting of software related inventions 
when they have technical application to industry or are in combination with hardware; 

h) Rationalization of provisions relating to time-lines with a view to introducing 
flexibility and reducing the processing time for patent applications, and simplifying 
and rationalizing procedures. 

2.6 Patent Indicators (Verbeek, A., et al., 2002) 

As mentioned, if one wants to address questions about the technological 
sources of economic growth, the rate of technological change or the innovation 
capabilities of different firms and countries or regions, almost no good measures are 
available. As a consequence, we are reduced either to pure speculation or to the use of 
various, rather distantly related, ‘residual’ measures and proxies. 

2.6.1 Advantages of Patent Indicators as Measures of Technological 
Activity 

The various advantages of patent indicators as measurement of technological 
activity are as follows: 

o The proximity of patents to the output of industrial R&D and other inventive 
and innovative activities implies that there is no other or better equivalent for 
this measurement purpose. 
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o Patents cover virtually every field of technology useful for the analysis of the 
diffusion of key technologies (except software, which is generally protected 
by copyright and can be patented only when it is integrated as a ‘technical 
function’ in a process of product), 
o Patent data offer a world-wide geographical coverage. 

o The very detailed classification schemes in patent documents which allow for 
almost unlimited choice of aggregation levels from broad fields to single 
products. 

o Patent documents include many details of interest, such as year of invention, 
technical classification, assignee, inventor etc. 
o The statistical processing of the data is largely free of errors, because patent 
documents are legal documents in which the details are recorded carefully and 
systematically. 

o Accessibility, easy and large-scale electronic availability of patent data 
(EPO/USPTO data on CD-ROM or tape, DERWENT, Dialog, Datastar, 
Delphion etc.). 

2.6.2 Limitations of Patent Indicators as Measures of Technological 
Activity 

The limitations of patent indicators as measure of technological activity are 
listed below: 

o Firms differ in their propensities (number of patents per unit of expenditure on 
R&D or just number of patent applications) to patent, 
o Technology fields differ in their propensity to patent. 

o Countries differ in their propensity to patent: size and geographical position 
give rise different expectations on the returns from patent protection 
(combination with other input or output indicators are necessary), 
o Differences among the various (national) patent systems, arising from legal, 
geographical, economic and cultural factors (e.g. the issue of the ‘home 
advantage’) have to be taken into account when using patents as indicators of 
technological progress. 
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In spite of all the difficulties, patent statistics remain a unique resource for the 
analysis of the process of technical change. Nothing else even comes close in the 
quantity of available data, accessibility, and the potential industrial, organizational, 
and technological detail (Griliches 1990). 

2.7 Patent resources 

Many of the organizations are working in Intellectual property rights and 
patents area worldwide, some of them are international organizations and many are 
country wise patent offices which provides information and documents. 

a. World Trade Organization (WTO) http://www.wto.org 

b. World Intellectual Property Organization (WIPO) http://www.wipo.int 

http://www.ompi.int 

c. INPADOC http://www.cas.org, http://pk2id.delhi.nc.in 

d. European Patent Office (EPO) http://www.european-patentoffice.org 

e. European Union - Office of Harmonisation of Internal Markets (OHIM) 

http://www.oami.eu.int 

f. Singaporean patent office http://www.ipos.gov.sg, http://www.surfip.gov.sg 

g. Canadian intellectual property office http://patentsl.ic.gc.ca 

h. Japanese patent office IPDL http://www.jpo.go.jp 

i. US patent and trademark office (USPTO) http://www.uspto.gov 

j. Indian patent office http://www.patejitoffice.nic.in 

k. Chinese patent office http://www.sipo.gov.cn 


1. Australian patent office 


http ://www. ipaustralia. go v. au 



m. German patent office 


http://www.dpma.de 


n. Hungarian patent office http://www.hpo.hu 

o. Portuguese patent and trademark office http://www.inpi.pt 

p. Korean intellectual property office (KIPO) http://www.kipo.go.kr 

q. The UK patent office http://www.patent.gov.uk 


2.8 Patent analysis: a survey of literature 

Analysis of patent data has long been considered to be an important method of 
assessing various aspects of technological change. Most studies have used patent 
statistics as a tool for either studying the relationship between technological 
development and economic growth (Penrose, 1951, Taylor and Silberston, 1973), or 
to assess the research and innovation process in a national and international context ( 
Bosworth, 1984, Schiffel and Kitti, 1978, Pad and Sassu, 1997). Some studies, 
however, have analyzed it from the perspective of company policy for assessing the 
level of technology development in a particular sector, taking patent statistics as a 
technology indicator ( Archibugi and Pianta, 1996; Ashton and Ashton; Basberg, 
1987; Mogee, 1991; Liu and Shyu, 1997). 

Patent analysis has also served as a basis for analyzing a firm's policy with 
regard to research, development (Liu and Shyu, 1997), estimation of technological 
strengths and weaknesses of competitors ( Narin and Noma, 1987), and exploitation 
of foreign markets ( Shipman, 1967). Patenting activity in a foreign country is usually 
undertaken with the objective of protecting a potential market in that country for a. 
firm's products. Since patenting abroad can be a prolonged and expensive process, it 
seems reasonable to assume that a company patents abroad only when it is confident 
that a relatively large market exists for its products in the country where it patents. It 
has been recognized that the patenting activity by foreign firms in a country is closely 
related to the technological level and patenting system in force in the country 
( Bosworth, 1984). I 
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Findings derived from the use of patents as a proxy of inventive and 
innovative activity were used to analyze the interdependence between industrial 
sectors and technology fields in Austria (Gassier, 1996). Patterns of innovative 
activities at the technological and country levels using patents data have also been 
examined, and it was found that while these patterns differ systematically across 
technological classes, they are very similar across countries ( Malerba and Orsenigo, 
1996). As an alternative to patent counts, patent claims data have been explored and 
claims were found to be a better indicator of national technological capacity ( Tong 
and Frame, 1994). Finally, the issue of the private value of patents has been estimated 
over time and across technologies by using stochastic models ( Lanjouw, 1998; 
Fikkert and Luthria, 1998). The basic finding is that innovations become obsolete 
fairly rapidly (within about 10 years of the application date). 

Since patenting data provide information on both the technology levels in a 
particular sector and the commercial intentions of potential competitors, they have the 
potential to serve as a useful input into corporate strategic planning. 


2.9 Information Retrieval (Weng, 2003) 

Faster the Internet and information technologies are developed, information 
exchanges become quicker and the larger amount of information is exchanged. 
However, it leads to a problem regarding information overflow, which has become 
serious threat for retrieval and classification. This result more time and money being 
spend on information to find required kind of information. Often people repeat 
processes that they have already performed, such as filtering documents to retrieves 
similar information at different times. 

In the knowledge discovery area, data mining techniques are dedicated to 
Information Retrieval (IR) of structured databases. On the other hand, text-mining 
techniques are also dedicated to IR of unstructured or semi-structured textual data. 
Text mining can also be seen as a combination of techniques that include all 
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techniques applied in solving the problems about information overflow in documents 
(Weng, 2003). 

2.9.1 Data mining (Osmar R. Zaiane, 1999) 

We are in an age often referred to as the information age. In this information 
age, because we believe that information leads to power and success, and thanks to 
sophisticated technologies such as computers, satellites, etc., we have been collecting 
tremendous amounts of information. Initially, with the advent of computers and 
means for mass digital storage, we started collecting and storing all sorts of data, 
counting on the power of computers to help sort through this amalgam of information. 
Unfortunately, these massive collections of data stored on disparate structures very 
rapidly became overwhelming. This initial chaos has led to the creation of structured 
databases and database management systems (DBMS). The efficient database 
management systems have been very important assets for management of a large 
corpus of data and especially for effective and efficient retrieval of particular 
information from a large collection whenever needed. The proliferation of database 
management systems has also contributed to recent massive gathering of all sorts of 
information. Today, we have far more information than we can handle: from business 
transactions and scientific data, to satellite pictures, text reports and military 
intelligence. Information retrieval is simply not enough anymore for decision-making. 
Confronted with huge collections of data, we have now created new needs to help us 
make better managerial choices. These needs are automatic summarization of data, 
extraction of the "essence" of information stored, and the discovery of patterns in raw 
data. 


With the enormous amount of data stored in files, databases, and other 
repositories, it is increasingly important, if not necessary, to develop powerful means 
for analysis and perhaps interpretation of such data and for the extraction of 
interesting knowledge that could help in decision-making. 

Data Mining , also popularly known as Knowledge Discovery in Databases 
(KDD), refers to the nontrivial extraction of implicit, previously unknown and 
potentially useful information from data in databases. While data mining and 
knowledge discovery in databases (or KDD) are frequently treated as synonyms, data 
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mining is actually part of the knowledge discovery process. The following figure 
(Figure 2.1) shows data mining as a step in an iterative knowledge discovery process. 

2.9.2 Steps involve in data mining (Osmar R. Zaiane, 1999) 

The Knowledge Discovery in Databases process comprises of a few steps 
leading from raw data collections to some form of new knowledge. The iterative 
process consists of the following steps: 

• Data cleaning: also known as data cleansing, it is a phase in which noise data 
and irrelevant data are removed from the collection. 

• Data integration: at this stage, multiple data sources, often heterogeneous, 
may be combined in a common source. 

• Data selection: at this step, the data relevant to the analysis is decided on and 
retrieved from the data collection. 

• Data transformation: also known as data consolidation, it is a phase in which 
the selected data is transformed into forms appropriate for the mining 
procedure. 

• Data mining: it is the crucial step in which clever techniques are applied to 
extract patterns potentially useful. 

• Pattern evaluation: in this step, strictly interesting patterns representing 
knowledge are identified based on given measures. 

• Knowledge representation: is the final phase in which the discovered 
knowledge is visually represented to the user. This essential step uses 
visualization techniques to help users understand and interpret the data mining 
results. 

2.10 Information retrieval techniques 

When a patent application is considered or submitted, the search for previous 
inventions in the field-known as prior art-relies crucially on accurate patent 
classification. The retrieval of patent documents is crucial to patent-issuing 
authorities, potential inventors, research and development units, and others concerned 
with the application or development of technology. The number of patent applications 
is currently rising rapidly worldwide, creating the need for an automated 
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categorization system (Smith, 2002; Hull et al, 2001; Calvert & Makarov, 2001). In 
industry, patents are a major source for gathering intelligence abojut competitors' 
activities, but this source necessitates sophisticated tools for meaningful data mining 
(Vachon, 2001). 

2.10.1 Document categorization 

Traditionally, document categorization has been performed manually. 
However, as the number of documents explosively increased, the task became no 
longer amenable to the manual categorization, requiring a vast amount of time and 
cost. This has lead to numerous researches for automatic document classification. A 
text classifier assigns a document to appropriate categories, also called topic, in a 
predefined set of categories. Originally, research in text categorization addressed the 
binary problem, where a document is either relevant or not w.r.t. a given category. In 
real-world situation, however, the great variety of different sources and hence 
categories usually poses multi-class classification problem, where a document belongs 
to exactly one category selected from a predefined set. Even more general is the case 
of multi-label problem, where a document can be classified into more than one 
category. While binary and multi class problems were investigated extensively, 
multilevel problems have received much less attention. As the number of topics 
becomes larger, multi-class categorizers face the problem of complexity that may 
incur rapid increase of time and storage, and compromise the perspicuity of 
categorized subject domain. A common way to manage complexity is using a 
hierarchy. Internet directories and large online databases are often organized as 
hierarchies (Tikk, D. and Biro, G., 2003) 

Classifications are hierarchically organized collections of classes and 
subclasses that attempt to classify all subject matter. At the top of the hierarchy, 
classes represent very general subject matter. These general classes subdivide into 
more specific classes or subclasses. A class that subdivides into a subclass is a parent 
class while the subclass is defined as a child class. Subclasses having the same parent 
class are sibling subclasses. Subject matter can be classified in any class including a 
parent and child class of any hierarchical level (Legakis, L., et al., 1993). 
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Patent databases are typically such where the use of a hierarchical category 
system is a necessity. Patents cover a very wide area of topics, and each field can be 
further divided into subtopics, until a reasonable level of specialization is reached. 
(The hierarchy of topics is often called taxonomy.) The International Patent 
Classification (IPC) is a standard taxonomy developed and administered by WIPO 
(World Intellectual Property Organization) for classifying patents and patent 
applications (Fall, C .J., et al., 2002). 

Automated classification of documents has generated a lot of research interest 
over the last few years. The huge increase in the number of electronically available 
documents during this period has made intellectual classification and indexing 
increasingly difficult and costly. For organisations with an interest in the storage, 
handling and retrieval of documents, automated classification tools are often seen as a 
useful remedy against the explosion of costs arising from intellectual document 
indexing and classification (Richer, G. and MacFarlane, 2004). v 

Patents and patent applications are a typical example of this scenario. There 
was a six-fold increase in the number of PCT applications between 1990 and 2001. 
Not only patent offices but also commercial patent information providers are 
struggling to come to terms with the volume of information published in patents. For 
patent offices, the primary classification task is the association of International Patent 
Classification codes as well as national or European classifications. These 
classification systems are too fine to realistically achieve sufficiently high accuracy 
by automated classifiers so some efforts have focused on using them for the pre 
classification stage, where patent applications are associated to the appropriate 
technical unit for the examination phase. Texts of patents are widely and freely 
available on the World Wide Web, making patents ideal subjects for automated 
classification. Consequently, various publications have described attempts of 
automated patent classification over the last years. Most of these have originated from 
patent offices, possibly due to the confidentiality that is often applied by commercial 
organisations with regards to their research (Richer, G. and MacFarlane, 2004). 

A brief description of each method is given from a to q indexed below, “a” to 
“j” introduction and classification is given by Yang, Y. (1999). 
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a. CONSTRUE 


CONSTRUE is an expert system developed at the Carnegie Group, and the 
earliest system evaluated on the Reuter corpus (Hayes and Weinstein 1990). 
Impressive results (about 90% in both recall and precision, on average) were reported 
on a small subset (3%) of this corpus. A major difference between the CONSTRUE 
approach and the other methods considered is the use of manually developed domain 
specific or application specific rules in the expert system. Adapting Construe to other 
application domains would be costly and labor-intensive. 

b. Decision Tree 

Decision Tree (Dtree) is a well-known machine learning approach to 
automatic induction of classification, Dtree algorithms are used to select informative 
word based on an information gain criterion, and predict categories of each document 
according to the occurrence of word combinations in the document. Evaluation results 
of Dtree algorithms on the Reuters text categorization collection were reported 
by Lewis and Ringutte (Using the IND package) (Lewis and Ringutte 1994) and 
Moulinier (using C4.5)(Moulinier 1997), respectively. 

c. Naive Bayes Method 

Naive Bayes (NaiveBayes) probabilistic classifier is also commonly used in 
text categorization (Mitchell 1996). The basic idea is to use the joint probabilities of 
words and categories to estimate the probabilities of categories given a document. The 
naive part of such a model is the assumption of the word independence. The 
simplicity of this assumption makes the computation of the NaiveBayes classifier far 
more efficient than the exponential complexity of non-nai've Bayes approaches 
because it does not use word combinations as, Predictors. Evaluation results of 
NaiveBayes on Reuters were reported by Lewis and Ringuette (1994) and 
Moulinier(1997), respectively. 
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d. Inductive rule learning 


Inductive rule learning in Disjunctive Normal Form (DNF) was tested in 
theW ASP-1. RIPPER and CHARADE systems (Apte 1994, Moulinier 1994, Cohen 
and Singer 1996). DNF rules are of equal power to Dtrees in machine learning Theory 
(Mitchell 1996). Empirical Results for the comparison because DNF and Dtree 
approaches, however, are rarely available in text categorization, excepts in an indirect 
comparision by Apte et al.,1994. 

e. Neural network 

Neural network (N Net) approaches to text categorization were evaluated on 
Reuters by Wieners (1995) and Ng (1997), respectively. For convenience, the former 
system (development at Xerox PARC) is referred to as N Net. There is another system 
name CLASS! Both systems use a separate neural network per category, learning a 
non-linear mapping from input words (or more complex such as singular vectors of a 
documents space) to a category. The PARC group tried both a perceptron approach 
and the three-layered neural networks, however, are available only on a subset of the 
Reuters categories, which are common in evaluation of other systems. The CLASSI 
system only uses preceptrons. 

f. Rocchio method 

Rocchio is a classic vector-space-model method for document routing or 
filtering in information retrieval. Applying it to text categorization, the basic idea is to 
construct a prototype vector per category using a training set of documents. Given a 
category, the vectors of documents belonging to this category are given a positive 
weight, and the vector remaining documents are given a negative weight. By 
summing up these positively and negatively weighed vectors, the prototype vectors of 
this category is obtained this methods is easy to implement and efficient in 
computation and has been used as baseline in several evaluation (Lewis 1996, Cohen 
and Singer 1996). A positive weakness of this method is the assumption of one 
centeroid per category, and consequently, Rocchio does not perform well when the 
documents belonging to a category naturally from separate clusters. 
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g. Least Squares Fits 


LLSF stands for Linear Least Squares Fits, a mapping approaches developed 
by Yang (Yang and Chute 1992). A multivariate regression model is automatically 
learned from a training set documents and categories. The training data are 
represented in the form of input/out put pairs where the input vectors are documents 
in the conventional vector space model (consisting of word with weights). And the 
output vector consists of categories (with binary weights) of the corresponding 
document. By solving a linear least square fit on the training pair of vectors, one can 
obtain a matrix of word category regression coefficients. The matrix defines a 
mapping from an arbitrary document to a vector of weighted categories. By sorting 
these category weights, a ranked list of categories is obtained for the input document. 

h. Sleeping Experts 

Sleeping experts (EXPERTS) are on-line learning algorithm recently applied 
to text categorization (Cohen and Singer 1996). On-Line learning aims to reduce the 
computation complexity of the training phase for large applications. EXPERTS 
update the weights of n-gram phrases incrementally. 

i 

i. K-nearest neighbor classification 

KNN stands for K-nearest neighbor classification. Given an arbitrary input 
document, the system ranks its nearest neighbor among the training documents, and 
uses the categories of the k top ranking to predict the categories of the input 
document. The similarity score of each neighbor document to the new document 
being classified is used as the weight of each of its categories, and the sum of the 
category weight over the k nearest neighbors is used for category ranking. 

j. WORD 

WORD is a simple, non-learning algorithm, which ranks categories for a 
document based on word matching between the document and category names. The 
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purpose of testing such a simple method is to quantitatively measure how much an 
improvement is obtained by using statistical learning compared to a non-learning 
approach. The conventional vector space model is used for representing documents 
and category names (each name is treated as a bag of words), and the smart system 
(Salton 1989) is used as the search engine. 

k. Multi-classification ranking technique (Fall, C., et al., 2002) 

It made used of two automated categorization tools for a series of multi- 
classification ranking tasks: the "rainbow" package, part of the bow (bag-of-words) 
toolkit (McCallum 1996), and the "SNOW" (sparse network of winnows) learning 
architecture (Carlson 1999). The rainbow package implements multinomial Naive 
Bayes (NB), k-Nearest Neighbors (k-NN), and Support Vector Machine (SVM) 
algorithms. The SNoW package is tailored for learning in large feature spaces and 
implements a sparse network of linear functions where the class labels are represented 
as linear functions over a common feature space (Carlson, 1999). A variation of the 
winnow update rule is used for training. SNoW is run with word occurrence indexing 
(binary weighting), which was found to produce better results than word frequency 
indexing, the same collection of stop words as rainbow. 

Overall, it is found that support vector machine algorithm outperforms the 
Naive Bayes, k-NN, and SNoW algorithms under similar conditions, particularly for 
categorization at IPC subclass level. Flowever, because it is computationally 
expensive to train, it may be necessary to reduce the training complexjity, by limiting 
the number of training documents, by term selection, and/or by limiting the length of 
the documents. 

l. Classification Methodology (Legakis, L., 1993) 

Classification retrieval, is inherently a problem of determining the closeness of 
two attribute vectors. The first vector is the list of words supplied by the user. The 
second vector describes for each classification in the classification scheme a list of 
words derived from the patents in the respective classification. Each word of the 
second vector can have an associated numeric value that is a measure of the relevance 
of the word to each classification. A method of determining the relevance of a list of 
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words to a respective classification using machine extractable information inherent in 
the patents and the classifications was explored. 

m. Co-citation Analysis (Lai, K et ah, 2003) 

The focus of the co-citation analysis is on the documents cited, by calculating 
the frequency of A and B that are co-cited by specific documents. They assess the 
similarity of A and B based respectively on the number of co citing or co-cited 
documents. Stuart and Podoly (1996) applied the conception of corporate patent co- 
citation to enable firms to be positioned and grouped according to the similarities in 
their patents. There approach can be divided in to three phases. Phase I selects 
appropriate databases to conduct patent searches according to the subject and 
objective of this study and then select basic patents. Phase H uses the co-cited 
frequency of the basic patent pairs to assess their similarity. Phase EH uses factor 
analysis to establish a classification system and assess the efficiency of the proposed 
approach. The main contribution of this approach is to develop a patent classification 
system based on patent similarities to assist patent manager in understanding the basic 
patents for a specific industry, the relationships among categories of technologies and 
the evolution of a technology category. 

n. Fuzzy Relational Thesaurus (Tikk, D., et al., 2001) 

Text categorization is the classification to assign a text document to an 
appropriate category in a predefined set of categories. It presents a new approach for 
the text categorization by means of Fuzzy Relational Thesaurus (FRT). FRT is a 
multilevel category system that stores and maintains adaptive local dictionary for each 
category. The goal of their approach is twofold one, to develop a reliable text 
categorization method on a certain subject domain, and second to expand the initial 
FRT by automatically added terms, thereby obtaining an incrementally defined 
knowledge base of the domain. We implemented the categorization algorithm and 
compared it with some other hierarchical classifiers. Experimental results have been 
shown that our algorithm outperforms its rivals on all documents corpora investigated. 
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o. Metadata (Richer G., 2004) 


During the last decade, the advance of machine-learning tools and algorithms 
has resulted in tremendous progress in the automated classification of documents. 
However, many classifiers base their classification decisions solely on document text 
and ignore metadata (such as authors, publication date, and author affiliation).The k- 
Nearest Neighbor algorithm was developed for the classification of patents into two 
different classification systems. Those using metadata (in this case inventor names, 
applicant names and International Patent Classification codes) were compared with 
those ignoring it. The use of metadata could significantly improve the classification of 
patents with one classification system, improving classification accuracy from 70.8% 
up to 75.4%, which was highly statistically significant. However, the results for the 
other classification system were inconclusive: while metadata could improve the 
quality of the classifier for some experiments (recall increased from 66.0% to 68.9%, 
which was a small but nonetheless significant improvement), experiments with 
different parameters showed that it could also lead to a deterioration of quality (recall 
dropping as low as 61.0%). 

p. Probabilistic Rocchio Method (Joachim T., 1997) 

The Rocchio relevance feedback algorithm is one of the most popular and 
widely applied learning methods from information retrieval. Here, a probabilistic 
analysis of this algorithm is presented in a text categorization framework (Thorsten 
Joachims). The analysis gives theoretical insight into the heuristics used in the 
Rocchio algorithm, particularly the word weighting scheme and the similarity metric. 
It also suggests improvements, which lead to a probabilistic variant of the Rocchio 
classifier. The Rocchio classifier, its probabilistic variant, and a naive Bayes classifier 
are compared on six text categorization tasks. The results show that the probabilistic 
algorithms are preferable to the heuristic Rocchio classifier not only because they are 
better founded, but also because they achieve better performance. 
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q. Distributional Clustering (Bakerti, L. 1998) 


This describes the application of Distributional Clustering to document 
classification. This approach clusters words into groups based on the distribution of 
class labels associated with each word (L. Douglas Bakerti, Andrew Kachites 
McCallumlt 1998). Thus, unlike some other unsupervised dimensionality reduction 
techniques, such as Latent Semantic Indexing, we are able to compress the feature 
space much more aggressively, while still maintaining high document classification 
accuracy. Experimental results obtained on three real-world data sets show that we 
can reduce the feature dimensionality by three orders of magnitude and lose only 2% 
accuracy-significantly better than Latent Semantic Indexing, class-based clustering, 
feature selection by mutual information, or Markov-blanket-based feature selection. 
We also show that less aggressive clustering sometimes results in improved 
classification accuracy over classification without clustering. 



Knowledge 



Figure 2.1 Data Mining 


CHAPTER 3 


SUPPLY CHAIN MANAGEMENT 

A supply chain is a network of facilities and distribution options that performs 
the functions of procurement of materials, transformation of these materials into 
intermediate and finished products, and the distribution of these finished products to 
customers. Supply chains exist in both service and manufacturing organizations, 
although the complexity of the chain may vary greatly from industry to industry and 
firm to firm. 

Practices experienced in the traditional management of the supply chain raised 
the need for conversion to a new paradigm of supply chain management (SCM). The 
traditional supply chain and manufacturing processes relied on experience and 
intuition of managers and were designed with long supply cycle times, large batch 
sizes, capacity based on annual volumes, volume-driven technology, and numerous 
suppliers for the same parts on the short-term base contracts. With traditional 
management processes, the goal of business activities was to maximize the efficiency 
of an individual functional unit by achieving competitive edges based on cost 
reduction. Under the traditional supply chain, efforts of manufacturers to meet the 
increased changing of customer requirements caused decreased margins, poor service 
performance, increased overhead costs, poor production process reliability, increased 
downtime due to changeovers, and high inventory levels of raw materials and finished 
product. None of these conditions are viable in a competitive market. Most product 
supply systems are out of balance with customer requirements (Lummus, Vokurka, & 
Alber, 1998). Davis (1993) listed reasons why SCM needs renewed attention: reduced 
profit margins due to pressure from increasing competition, needs for administrating 
multisite manufacturing, cut-throat marketing channels, maturation of the world 
economy, customer service demands for quick and more reliable delivery, and ressure 
to reduce inventories. According to Cooper and Ellram (1993), SCM is designed to 
solve these problems and is important to reduce inventory investment in the chain, to 
increase customer service, and to help build a competitive advantage for the channel. 
With a changing management focus, companies also began to realize that 


28 




I 

i 

maximization of efficiency in one department or one functional unit is less desirable 
than optimal performance for the whole company. Needs for effective vertical 
integration and consumers' desire for a wider variety and complexity of products have 
led to demand for SCM (Lummus, Vokurka, & Alber, 1998). 

3.1 Definition 

Different people have stated supply chain in different terms. One of the 
definitions of supply chain, which is given by Ganeshan and Harrison, is: “A supply 
chain is a network of facilities and distribution options that performs the functions of 
procurement of materials, transformation of these materials into intermediate and 
finished products, and the distribution of these finished products to customers”. 
Supply chain can be considered as a series of processes that change and add value to a 
product as it moves down the chain. 

The concept of SCM is relatively new to academics and practitioners, 
appearing first in 1982 (Cooper, Lamber, & Pagh, 1997). Although the term, supply 
chain management, has been used since the 1980s and the academic and trade presses 
have given extensive attention to the concept, confusion still persists in defining what 
is SCM (Bechtel & Jayaram, 1997). Many researchers have tried to define the 
meaning of SCM. Table 2-2 provides the summary of each author's definitions. 
Although subtle differences are found in the word choice and expression, 
commonalities contribute to an understanding of core concepts in the definition of 
SCM. The first component is the range of participants. All of the definitions in Table 

2.1 state that all channel members within a company or between companies, including 
supplier, manufacturer, distributor, and customer, should be involved in the chain 
activities and collaboration between members. The second component is the flow of 
materials and information. Agreement across definitions is that materials, whether raw 
materials or finished goods, and information flow simultaneously both upstream and 
downstream in the chain. Third, to manage the flow of materials and information and 
to provide high customer value, integrated and coordinated value-added activities are 
required (i.e., cross-functional approach, joint planning and forecasting, flexible 
operations). 
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Authors 

Definition 

Alber & Walker (1997) 

"The global network used to deliver products and services 

from raw materials to end customer? through an 

i 

engineered flow of information and physical distribution." 

(p. 203) 

Cooper & Ellram (1993) 

"An integrative philosophy to manage the total flow of a 
distribution channel from the supplier to the ultimate user 
... greater coordination of business processes and 
activities ... across the entire channel and not just between 
a few channel pairs." (p. 13) 

Giunipero & Brand 

(1996) 

"Definitions can be grouped into three major categories: 1) 
The management of the flow of goods from supplier to 
final user; 2) The system-wide coordination of product and 
information flows; and 3) The development of 
relationships and the integration of all activities that 
provide customer value throughout the distribution 
channel." (pp. 29-30) 

Jones (as cited in Goffin, 
Szwejczewski, & New, 
1997) 

"Managing the entire chain of raw material supply, 
manufacture, assembly, and distribution to the end 
customer." (p. 422) 

Lee & Ng (1997) 

"A network of entities that starts with the suppliers' 
suppliers and ends with the customers' customers for the 
production and delivery of goods and services." (p. 191) 

Lummus, Vokurka, & 
Alber (1998) 

"A network of entities through which material and 
information flow. Those entities include suppliers, 
carriers, manufacturing sites, distribution centers, retailers 
and customers." (p. 49) 

Palevich (1997) 

"All of those activities associated with moving goods from 
raw materials through the end user: sourcing and 
procurement, production scheduling, order processing, 
inventory management, transportation, warehousing, and 
customer service. Importantly, it also embodies the 
information systems to monitor these activities." (p. 1) 

Spekman, Kamauff, & 
Myhr (1998) 

"A process for designing, developing, optimizing, and 
managing the internal and external components of the 
supply system, including material supply transforming, 
materials and distributing finished products or services to 
customers, that is consistent with overall objectives and 
strategies." (p. 631) 

TICCE 

(as cited in Cooper, 
Lambert, & Pagh, 1997) 

"The integration of business processes from end user 
through original 

suppliers that provides products, services and information 
that add value for customers." (p. 2) 


Table 3.1 Definition in Supply chain Management 
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3.2 Supply Chain Decisions 


The decisions for supply chain management can be classified into two broad 
categories — strategic and operational. As the term implies, strategic decisions are 
made typically over a longer time horizon. These are closely linked to the corporate 
strategy, and guide supply chain policies from a design perspective. On the other 
hand, operational decisions are short term, and focus on activities over a day-to-day 
basis. The effort in these types of decisions is to effectively and efficiently manage 
the product flow in the "strategically" planned supply chain. 

There are four major decision areas in supply chain management: 1) location, 
2) production, 3) inventory and 4) transportation (distribution), and there are both 
strategic and operational elements in each of these decision areas. 

3.2.1 Location Decisions 

The geographic placement of production facilities, stocking points, and 
sourcing points is the natural first step in creating a supply chain. The location of 
facilities involves a commitment of resources to a long-term plan. Once the size, 
number, and location of these are determined, so are the possible paths by which the 
product flows through to the final customer. These decisions are of great significance 
to a firm since they represent the basic strategy for accessing customer markets, and 
will have a considerable impact on revenue, cost, and level of service. These decisions 
should be determined by an optimization routine that considers production costs, 
taxes, duties and duty drawback, tariffs, local content, distribution costs, production 
limitations, etc. Although location decisions are primarily strategic, they also have 
implications on an operational level. 

3.2.2 Production Decisions 

The strategic decisions include what products to produce, and which plants to 
produce them in, allocation of suppliers to plants, plants to DC'Sj, and DC's to 
customer markets. As before, these decisions have a big impact on the revenues, costs 
and customer service levels of the firm. These decisions assume the existence of the 
facilities, but determine the exact path(s) through which a product flows to and from 
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these facilities. Another critical issue is the capacity of the manufacturing facilities 
and this largely depends the degree of vertical integration within the firm. Operational 
decisions focus on detailed production scheduling. These decisions include the 
construction of the master production schedules, scheduling production on machines, 
and equipment maintenance. Other considerations include workload balancing, and 
quality control measures at a production facility. 

3.2.3 Inventory Decisions 

These refer to means by which inventories are managed. Inventories exist at 
every stage of the supply chain as either raw material, semi-finished or finished 
goods. They can also be in-process between locations. Their primary purpose to 
buffer against any uncertainty that might exist in the supply chain. Since holding of 
inventories can cost anywhere between 20 to 40 percent of their value, their efficient 
management is critical in supply chain operations. It is strategic in the sense that top 
management sets goals. However, most researchers have approached the management 
of inventory from an operational perspective. These include deployment strategies 
(push versus pull), control policies, the determination of the optimal levels of order 
quantities and reorder points, and setting safety stock levels, at each stocking location. 
These levels are critical, since they are primary determinants of customer service 
levels. 

3.2.4 Transportation Decisions 

The mode choice aspects of these decisions are the more strategic ones. These 
are closely linked to the inventory decisions, since the best choice of mode is often 
found by trading-off the cost of using the particular mode of transport with the 
indirect cost of inventory associated with that mode. While air shipments may be fast, 
reliable, and warrant lesser safety stocks, they are expensive. Meanwhile shipping by 
sea or rail may be much cheaper, but they necessitate holding relatively large amounts 
of inventory to buffer against the inherent uncertainty associated with them. Therefore 
customer service levels and geographic location play vital roles in such decisions. 
Since transportation is more than 30 percent of the logistics costs, operating 
efficiently makes good economic sense. Shipment sizes (consolidated bulk shipments 
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versus Lot-for-Lot), routing and scheduling of equipment are key in effective 
management of the firm's transport strategy. 


3.3 Supply Chain Benefits 

Many previous studies conducted in various industries have revealed tangible 
benefits generated from efficient SCM (Harrington, 1999; Higginson & Alam, 1997; 
Alber & Walker, 1997; Palevich, 1997; Giunipero & Brand, 1996; Cooper & Ellram, 
1993). The summary of benefits is presented in Table 2.2. These benefits can be 
categorized into four groups. First, financial benefits were reported: reduction in costs 
tied with high level of inventory, shipping, and operating costs; cost advantage over 
competitors; and increased profit margin with lower product costs. These cost 
reductions were achieved without downsizing, laying off employees, or closing 
plants. Second, companies' operational activities were improved: reduced cycle times, 
lower inventory levels, increased stock availability, less stockouts, increased 
inventory turns, and greater productivity in operations. Third, customer service was 
increased: more reliable delivery and increased responsiveness to changes. Lastly, 
closer coordination among channel members is an important benefit. This benefit 
results in an improvement in the quality of products and information and an increase 
in the sharing of expertise and risks, which creates a competitive advantage and 
greater profitability. 
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Benefits of 
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& Allram 
(1997) 

Palevich 

(1997) 
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52 
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of the 
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work 
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analysis 
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al 

work 
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with chain 
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* 

* 


* 

* 

Cost 

advantage 




* 



Cost reduction 



* 

* 


* 

Customer 
service level 
improvement 

* 






Cycle times 
reduction 

* 

* 

* 


* 

1 

* 

Inventory 

reduction 



* 

* 

* * 


Inventory 

turns 

improvement 

* 

* 





Productivity 

improvement 



* 




Profit margins 
improvement 

* 






Reliable 

delivery 



* 


* 


Responsivene 
ss to changes 




* 




Table 3.2 Benefits of supply chain Management 
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3.4 Element of supply chain management (Watson, E., Class Notes, 
Louisiana state university) 

The six basic elements to the strategic and operational management of the 
supply chain are 

1. Plan (Planning) 2.Buy (Purchasing) 3. Make(Manufacturing) 

4. Move (Distribution) 5. Sell (Marketing) 6. Consume (Consumption) 


a. Plan (Demand and supply planning): 

The planning of, synchronization and deployment of products and services 
across the entire supply chain to meet both operational needs and customer demands. 

b. Buy (Sourcing and Supplier Management): 

This involves buying of best materials at the lowest cost. When choosing a 
supplier, focus should be on developing velocity, quality and flexibility while at the 
same time reducing costs or maintaining low cost levels. In short, strategic decisions 
should be made to determine the core capabilities of a facility and outsourcing 
partnerships should grow from these decisions. 

c. Make (Manufacturing and Operations): 

For product manufacturers, new strategies such as lean manufacturing, work 
flexibility, and configure/make-to-order manufacturing are the keys to quantum 
improvements in cost, speed and quality. Likewise, service sector companies are 
streamlining operations and deploying new technologies to provide more services 
faster and at a lower cost. 

d. Move (Transportation and Distribution): 

Transportation, distribution and warehousing are developing into value-added 
services through new methods, such as flow optimization, cross-docking, 
consolidation, non-stop logistics, and tracking and racing systems. 
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e. Sell (Customer and Order Management): 

Customer and order management is where; building on other operational 
improvements, supply chain differentiation becomes visible to customers. In support 
of sales and marketing, clients can create revenue-generating competitive 
differentiation in areas including channel management, product and channel 
segmentation, product-service bundling, order management and configuration and 
setting and achieving targeted and customer service levels. 

3.5 Importance of supply chain efficiency (Watson, E., 2001) 

1. Improved Customer Service 

This Often means the difference between success and failure for companies. If a 
"customer" is seeking your product and it is not available when he/she wants it, the 
customer will purchase someone else's. So, having the right product at the right place 
at the right time is one way to define "customer service." 

2. Save Money/Reduce costs 

This can be defined in many ways, but in its broadest sense, it includes reducing the 
cost of getting the product to market. In other words, containing all costs associated 
with solving the product through the supply-chain. And this usually results in a more 
time-efficient supply chain as well. According to a recent benchmarking study 
conducted by Pittiglio Rabin Todd & McGrath (PRTM), one of the fpunders of the 
Supply-Chain Council, best in class companies have an advantage in total supply 
chain management cost of 3 to 6 percent of revenue. (Total supply chain management 
cost is the sum of Order Management, Material Acquisition, Inventory Carrying, and 
Supply-Chain Finance, lanning, and MIS Costs. 
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3.6 Implementing Supply Chain Management (Watson, E., 2001) 


• Need to provide extra service to some customers. 

• Corporate initiatives aimed at improving on-time delivery. 

• Mandates to reduce costs to enable more aggressive pricing. 

• Demands for faster delivery of customized products. 

• De-coupling of production and storage capabilities through transportation and 
storage capabilities. 

• Focus shifted from process optimization to logistics. 

• | Manufacturing ~ 45% | Marketing ~ 25% | Logistics ~ 20% | Profits | 

• More instances of multi-site manufacturing where several independent entities are 
involved in the production and delivery process. 

• Increasingly cutthroat marketing and distribution channels. 

• Maturation of the world economy and heightened demand for “local” products. 

• Competitive pressures to provide exceptional customer service. 

• Channel partnership is required to implement just-in-time, quick response strategies. 

3.7 Characteristics of Supply Chain Management (Watson, E., 2001) 

• Ability to source raw material or finished goods from anywhere in the world. 

• Centralized global business and management strategy with local execution. 

• On-line, real-time distributed information processing to the desktop, providing total 
supply chain information visibility. 

• Ability to manage information not only within a company but also across industries 
and enterprises. 

• Seamless integration of SCN processes, third-party suppliers, IS cost accounting 

standards and measurements systems. i 

• Reconfiguration of the supply chain organization into high-performance teams. 

3.8 Advantages Of Supply Chain Management (Watson, E., 2001) 

• Lower total delivered costs and lead times 

• Improved trading partner relationships and value 
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• Improved inventory performance in both cost and velocity 

• Improved transportation performance in cost, speeds, and service 

• Lower break-even costs and times 

• Increased revenues 

• Increased flexibility/visibility/responsiveness 

• Improved customer service and value 

• Gained overall competitive advantage/share I 

• Improved shareholder value 

3.9 Seven Principles Of Supply Chain Management (Watson, E.,2001) 

1. Segment customers based on service needs 

Companies traditionally have grouped customers by industry, product, or trade 
channel and then provided the same level of service to everyone within a segment. 
Effective supply-chain management, by contrast, groups customers by distinct service 
needs— regardless of industry— and then tailors services to those particular segments. 

2. Customize the logistics network 

In designing their logistics network, companies need to focus intensely on the service 
requirements and profitability of the customer segments identified. The conventional 
approach of creating a "monolithic" logistics network runs counter to successful 
supplychain management. 

3. Listen to signals of market demand and plan accordingly 

Sales and operations planning must span the entire chain to detect early warning 
signals of changing demand in ordering patterns, customer promotions, and so forth. 
This demand-intensive approach leads to more consistent forecasts and optimal 
resource allocation. 
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4. Differentiate products closer to the customers 


Companies today no longer can afford to stockpile inventory to compensate for 
possible forecasting errors. Instead, they need to postpone product differentiation in 
the manufacturing process closer to actual consumer demand. 

5. Strategically manage the source of supply 

By working closely with their key suppliers to reduce the overall costs of owning 
materials and services, supply-chain management leaders enhance margins both for 
themselves and their suppliers. Beating multiple suppliers over the head for the lowest 
price is out, Andersen advises. "Gain sharing" is in. 

6. Develop a supply-chain-wide technology strategy 

As one of the cornerstones of successful supply-chain management, information 
technology must support multiple levels of decision making. It also should afford a 
clear view of the flow of products, services, and information. Develop a supply-chain- 
wide technology strategy. 

7. Adopt channel-spanning performance measures. 

Excellent supply-chain measurement systems do more than just monitor does internal 
functions. They adopt measures that apply to every link in the supply chain. 
Importantly, these measurement systems embrace both service and financial metrics, 
such as each account's true profitability. 



CHAPTER 4 


PROBLEM DESIGN AND DATA COLLECTION 


Initially we downloaded patents related to field of supply chain management 
from USPTO and EPO using an initial set of keywords. Patents from different sources 
are in different HTML formats and scripts, so they cannot be used directly for further 
processing. Especially USPTO files, have unstructured format with lots of codes and 
scripts in HTML data. This requires parsing of these file so that they can be read in 
database. 

The standard methodology shown in figure 4.1 is used to parse the files in 
acceptable format. First tag file from unstructured file is crated and then this tag file is 
converted in to XML files. These file are then read into a database with specified field 


structure. 

A 

Untagging 

Program 

)x 

Tag 

XML .A 


HTML 

File 


Tag File 


XML file 


Figure 4.1: Parsing 

After the initial phase, which was patent search and conversion in to XML 
second phase followed. In the second phase Mirdha, Bhuvanesh (2004) used K 
nearest neighbor methodology for mapping of patents documents. 

The mapping was done for the purpose of classification of the patents in three 
level of hierarchy in various subclasses. Our attempt is to further improve the 
accuracy and precision of such classification. 

4.1 Metadata 

It has been stated in literature review (Ritcher, G., et al., 2004), that use of 
metadata improves accuracy. Metadata is commonly described as “data about data”. 
It is usually distinguished from the data itself through suitable mark-up or by being 
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stored in a physically different location. In the case of textual data, common types of 
metadata are author, the date of publication or keywords and descriptors. An 
important difference between (textual) data and metadata on the linguistic level is the 
fact that metadata elements become semantically meaningful through the “field” in 
which they occur, whereas textual data elements obtain their meaning through 
linguistic rules and their position within their context. 

4.1.1 Classification of Metadata 

In context of patent classification problem the metadata is all the relevant 
information that patent provides us along with Keywords. In our case we have taken 
following attribute to represent metadata 

o Keywords 
o Keyword-pair 
o Author name 
o Assignee name 
o I-class 
o U-class 
o Citation 

Keywords are further divided in to three classes, depending on its location. For 
the purpose of better classification we have to consider different weightage to 
Keywords depending upon the location. For example Keywords, which are in the 
Abstract and Title are likely to be more important, compared to Keywords, which are 
present in Claim part. 

The Keywords-pair are used because it is seen that certain pair of Keywords 
occurring in the same document are better classifier then the single Keyword. 

Author and Assignee name are being used in the classification as metadata 
because it is general perception that people and company like to work in certain area 
(represented by class), at a time. 
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I-class and U-class are found in every USPTO patent. The reason of using 
them, as metadata is that correlation has been observed in the attribute data and 
subclasses. It may be noted that for traditional Technology I-class and U-class are 
robust classifiers. However for business method and software patents systems it is not 
well developed. 

Patent citations are defined as the count of citations of a patent in subsequent 
patents, and citations per patent represent the relative importance of the patent. Based 
on this idea, patent citation analysis executes a bibliometrics analysis on patent 
documents. In essence, citation-based technique attempts to link patents in a patent 
database in the same way as science citation analysis links references in a scientific 
paper database (Karki, 1997). Ultimately, patent citation analysis produces such 
technological indices as citations per patent, highly cited patents, no patent link, 
technical impact index, current impact index, technology cycle time, and so forth. It is 
natural for the patent having similar nature to cite each other. 

4.2 Data Analysis 

Based on the various attributes discussed earlier, first step is to create vector 
for each patent document with all related attributes. The step-by-step methodology is 
discussed in following sections. 

4.2.1 Methodology 

There are three levels in hierarchy of SCM classification as the result of work 
done by Mirdha, Bhuvanesh (2004). In each level there are subclasses. The initial 
step is to find out important attributes, relevant to a subclass. The frequency with 
which an attribute appear in a subclass is taken as a measure of relevance. Table 4.1 
shows an example of collection of data with respect to keywords and frequency of 
their occurrence for different subclasses. 
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f 

Subclass 

Allocation 

4 

See 

Allocation 

2 

Sep 

Allocation policy 

2 

Sep 

Assembly line 

2 

See 

Automatically 

14 

See 

Automatically 

5 

Sep 


Table 4.1 Frequency distribution 


Here second column denotes number of times that keywords are repeated in 
particular subclass. By this we are able to identify which attribute is more significant 
in a particular subclass. Similar compilation has been done for all attributes for every 
subclass. 


4.2.2 Data cleaning 

Once data about frequency of attributes in subclass is complied, data is 
analyzed to identify those attributes, which are more frequent in a given subclass but 
are less frequent in other subclasses. These attributes are relevant for a document to 
be classified in a specific subclass. Table 4.2 shows an example of this compilation 


■H 

f 

Ratio 

No. Of 
Patents 

Subclass 


2 

0.035088 

57 

Decision support system: - 


16 

0.056537 

283 

See 


17 

0.22973 

74 

Sep 

Enterprise 

5 

0.108696 

46 

E-commerce 


7 

0.35 

20 

Strategic planning 


Table 4.2 Data cleaning 


As shown in Table 4.2, we compare ratio among the subclass, which are at 
same level. We neglect those attributes data for which frequency ratio is more or less 
equal to each other. The purpose of data cleaning is to reduce the data size and to 
identify important attributes for each subclass. Similar analysis is done for all other 
attributes. The results of this analysis are given in Annexure C-J. 
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4.2.3 Data Integration 


We have the data about relevant attributes. To do further analysis we have to 
assign weight to these attributes. After the data cleaning we have attributes value in 
form of ratio as discussed in section 4.2.2. 



Fig 4.2 Subclasses distribution 

In fig 4.2 a, b, c and d represents subclasses. While a, belong to level p of 
hierarchy other represent lower level (p+1). If an attribute occurs with the same 
frequency in a child node as in a parent node, then its value to discriminte the 
document in lower level subclasses is insignificant. Hence for each attributes or 
subclass level we consider the ratio of relative frequency in parent and child node. 


HjflBSSRSS^ 

f 

Subclass 

Ratio 1 

Ratio 2 

Enterprise 

16 

See 

0.05654 


Enterprise 

17 

Sep 

0.22973 



5 

E-commerce 

0.1087 

1.922554 


7 


0.35 

1.523529 

■nm 

7 

Bssc 

0.41176 

1.176471 


Table 4.3 Ratio 2 calculations 


Here as shown in Table 4.3 E-commerce, Strategic planning and Business 
strategy for supply chain are subclasses belonging to lower level while See and Sep 
belongs to higher level. Ratio 1 is ratio that we have found and discussed in section 
4.2.2.For a child subclass ratio 2 is calculated as 
Ratio 1 (Parent Node) 

Ratio 1 (Child Node) 
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Ratio 2 for b gives the probability of document in a (Parent node), being classified in 
lower level subclasses b. Similar ratio 2 for c, and d respectively gives the probability 
of document being classified in those subclasses b. If we use Ratio 2 as classifier it 
will not be good predictor, if it has similar value for all subclass at a given level. 


As, these ratio can have large range so we normalize these ratios. After the 
normalization the value of Ratio 2 will be between, 0 to 1 . For normalization we add 
values of all child subclass of patent (Ratio 2) together and divide the sum into the 
ratio of the subclass. For example for attribute ‘b’ the value after normalization will 
be given as 


Ratio3 b 


(Ratio! b ) 

( Ratio2 b + Ratio! e + Ratio! d ) 


I 



f 

Subclass 

Ratio 1 

Ratio 2 

1 After 

Normalization 
(Ratio 3) 
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See 

0.14841 


0.687401084 

HUB9MI 

5 

Sep 

0.06757 


0.312957701 

Inventory 

3 

Assembly plant management 

0.125 

0.842262 

0.094636169 

Inventory 

4 

Logistic management 

0.17391 

1.171843 

0.131667713 

Inventory 

2 

E-commerce 

0.04348 

0.292961 

0.032916928 


Table 4.4 Normalization table 

Figure 4.4 shows final value of attribute (here, keyword inventory) after 
normalization. 


4.2.4 Patent Attribute Vector 

Table 4.5 shows the groups and the numbers in each group of attributes, as 
well as the measure of the value in patent document. An attribute vector for each 
document is prepared. An example for such a document is shown in Table 4.6. 


Attributes groups 

Number of 
Attributes 

Measure 

Keyword Abstract 

66 

Frequency 

Keyword Claim 

73 

Frequency 

Keyword Title 

43 

Frequency 
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I-class 

60 

Presence/ Absence 

U-class 

94 

Presence/ Absence 

Author Name 

165 

Presence/ Absence 

Assignee Name 

14 

Presence/ Absence 


Table 4.5 Attribute Data 


Keyword 

Keyword 

Keyword 

Keyword 

I 

- 

u 

- 

Author 

Assignee 

Abstract 

Claim 

Title 

Pair 

class 

class 

Name 

Name 

Kwaj 

W 

KwCj 

0 

Kwtj 

0 

Kwpi 

0 

Is 

1 

Ui 

0 

Aunj 

0 

Asn; 

1 

Kwaj 

0 

Kwcj 

W ' 

Kwtj 

w 

Kwpj 

0 

Ij 

0 

Uj 

0 

Aim,- 

0 

Asnj 

0 

Kwa k 

W 

KwCk 

W 

Kwtk 

0 

Kwpk 

W 

EH 

□ 

Uk 

1 

Aunk 

1 

Asnk 

0 


Table 4.6 Attribute vector for a subclass 


For all these attributes we have their weight for each subclass, completed on 
the basis of present classification. For each class in classification, we prepare a vector 

of attributes, where weight is calculated in previous section as ratio 3,,is assigned to 

i 

the attribute. If an attribute is not present it assume zero value. Table 4.6 attributes 
vector for such vector in a subclass. 


4.2.5 Data mining technique 

Data mining techniques are those set of techniques, which are applied to 
extract potentially useful patterns. The purpose of these methods is to improve the 
accuracy of the patent classification. The techniques used here is to develop 
discriminant function, for each class such that it can calculate the document which are 
relevant to a class and differentiate relevant document from these which are not 
relevant. 

Attributes related to Keywords have weight in-between 0-1. For other 
attributes it is either 0 or 1. To classify a patent document, further in to subclass first 
of all every weight of each attributes is checked. For each of the subclass. If any of 
these weight(s) is equal to one for a particular subclass then, patent document is 
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directly classified in to this subclass. If none of the weight is found to be one for any 
of the subclass then a weighted function of attribute is developed. 

We have selected following function for the purpose. 

1. wi (Linear Weight) 

“f” is frequency of occurrence of i th attribute in a patent document. 

“w” is the weight of the attribute for the given class. 

2. ^ fi*wi 2 (Square Weight) 

3. (Indexed Weight) 

The indexed weights are assigned as follows: 


S. No 

Original weight range 

Modified weight 

1 . 

1 

1 


1-.9 

.9 

3. 

.9:6 

.7 

4. 

.3:6 

.5 

5. 

Less then .3 

0 


Table 4.7 Modified Weight 


4. Discriminant analysis (Statistical Technique) 

To choose the appropriate function we selected a sample of 25 documents 
from Supply chain planning subclass. Patent Document from SCP is further classified 
in to a) Strategic Planning (SP) b) Tactical Planning (TP) c) Operational Planning 
(OP). We have calculated value of all 25 documents for each of the subclass for these 
three functions. 

Every document in parent subclass value is determined by running functions 
shown above representing each of child subclass. Next step is to compare results for 
child subclasses whichever child subclass gives maximum value for a given function 
for a document that document get classified under that subclass. As, we have discussed 
that for each document we found its value of each subclass for every function. The 
example of calculation of value of function and thus subclass determination is shown 
below. 
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Patent No. 

Function 

SP 

TP 

OP 

Manual 

Reading 

Status 

5.xml.txt 

fl 

2.3465314 

0.641733 

0.5433 

SP 

Correct 

f2 

1.493742 

0.134683 

O' 1587 

SP 

Correct 

f3 

2.1 

0.3 

0.9 

SP 

Correct 





0 

TP 

Correct 




0 

TP 

Correct 

f3 

0.2 

3.1 

0 

SP 

Wrong 


Table 4.8 Calculation of function value 


The complete table of classification of document is shown in table 4.9. Here 
each cell denote the status of patent document according to function value. Last 
column denotes the correct status as found out by manual reading of document. The 
irrelevant documents are those which aren’t related to supply chain management 
patent. The detailed result has been shown in Annexure K. 



function 

function 

function 

Manual 

Patent No. 

1 

2 

3 

Reading 

1.xml.txt 

sp 

irrelevent 

sp 

irrelevent 

2.xmi.txt 

. ... S P - 

sp 

SP 

sp 

3.xml.txt 

op 

tp 

MSB! 

op 

4.xml.txt 

tp 

sp 

sp 

tp .. 

5.xml.txt 

sp 

sp 

sp 

..sp 

6.xml.txt 


sp 

sp 

sp 

7.14.xml.txt 

sp 

tp 

sp 

sp 

7.21.xml.txt 

sp 

sp 

_ sp. 

op 

7.22.xmi.txt 

tp 

sp 

sp 

tp 

7.26.xml.txt 

irrelevent 

irrelevent 

sp 

irrelevent 

7.xml.txt 

sp 


sp 


8.xml.txt 

op 

■E9H 

sp 

op 

9.xml.txt 

sp 

— B 

sp 

irrelevent 

10.xml.txt 

sp 

sp _ 

sp 

• sp 

11.xml.txt 

sp 

sp 

CL 

</) 

SP 

12.xml.txt 

sp 

irrelevent 

sp 

irrelevent 

13.xml.txt 

irrelevent 

irrelevent 

op 

irrelevent 

14.xml.txt 

tp 

sp 

tp 

tp 

15.xml.txt 

sp 

tp 

sp 

sp 1 

16.xml.txt 

op 

op 

op 

op 

1287.xml.txt 

sp . ... 

sp 

sp 

irrelevent 

1293.xml.txt 

tp 

tp 

sp 

tp 

19-8.xml.txt 


sp 

sp 

sp 

19.xml.txt 

tp _. 

tp 

B9 


18.xml.txt 


sp _ 

sp 

sp 


Table 4.9 Status of patent document 
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At last, we have found out the number of correctly classified document given 
by each of the three functions. 


Function 

correct 

wrong 

function 1 

19 

6 

function 2 

14 

11 

function 3 

9 

16 


Table 4.10 Accuracy by functions 


Function 4(Standardized Discriminant Functions) 
After running query in SPSS we found following function 


Abstract 

1 

EAS 

-0.811 

Information 

-0.177 

Node 

0.304 

Planning engine 

-0.307 

Forecast 

5.137 

Location 

-2.215 

Schedule 

1.325 

claim 

j 


Planning engine 

0.159 

Computer implemented 

-0.286 

Location 

3.461 

Forecast 

-1.985 

Display 

1.935 

title 


Enterprise 

-0.193 


Table 4.1 1 Discriminating Function 


The result of running the discriminant function 


Patent No. 

Function 

value 

1.xml.txt 

-0.811 

10.xml.txt 

-0.5430468 

11.xml.txt 

-0.72617173 

12.xml.txt 

6.895 

1287.xml.txt 

1 .5822247 

1293.xml.txt 

0 

13.xml.txt 

1 .7556251 

14.xml.txt 

9.077225 

15.xml.txt 

-0.7829055 

16.xml.txt 

3.461 


Table 4.12 Function 4 Results 
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We haven’t used discriminant function for the calculation purpose as most of 
the words were found insignificant in the analysis also the plot that we obtained from 
this function wasn’t able to discriminant between various lower level classes. On 
running the results on 25 sample documents and analyzing the results following 
observation were made. 

o First two discriminant function produced similar result is most of the 
case. 

o Discriminant analysis gave very few significant words and their weight 
when discriminating function based on this used for classification 
purpose it was unable in classifying between documents belonging to 
different subclass. 

o Modified weight function at a few times produced similar value so it 
was not possible to classify document in either category. 

o The differences between values of weight as given by second function 
were less. 

o For the documents which are non-relevant, second function shown best 
result as, function value of these document are very small. 

Overall we found out that first function gives best result amongst all the 
four functions used. For classifying the relevant document into subclasses, we 
use this function for the purpose of classifying relevant document. Second 
function is purposed to be used for searching any non-relevant document. 
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Fig 4.3 classification of patent document 
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CHAPTER 5 


RESULTS AND DISCUSSION 


The application of methodology on patent documents for classification in 
various subclasses has produced results for those subclasses. For the purpose of 
effective representation of results we have used contingency table. In this table 
classification effectiveness is mainly measured using the notions of Precision (P) and 
Recall(R). Precision is defined as the probability that a document classified in a 
particular class is correctly classified and it belongs to that class. For a class Recall is 
defined as the probability that a document is correctly classified in that class. Both 
Precision and Recall are measured as shown in contingency table 5.1. 


Subclass 

Manually reading 

Yes 

No 

Results by Classifier 
( analysis) 

Yes 

TPi 

FPj 

No 

FN,; 

TN; 


Table 5.1 Contingency table 


In table 5.1, the number of documents that have been correctly classified under 
a subclass is called as True Positive (TP;). False Positive (FPj) are those documents 
that have been incorrectly classified under that subclass. True Negative (TNj) are 
those non relevant documents that have been classified correctly. False Negative (FN;) 
are incorrectly classified documents also known as Errors of Omission and indicate 
the incorrect prediction that an instance is negative. 

TP. . TP- 

Estimate of Precision (P) = — and Estimate of Recall (R) = — — — 

TP, + FPj TP i +FN i 


5.1 Level 0-1 

Here we have applied following function over the patents at level 0. Level 0 
means those patents which are found to be relevant but are not classified into next 
(First) sublevel. Here, we have to find out the next subclass from application of 
Analysis. 
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5.1.1 Supply Chain Planning 


Subclass: Supply Chain Planning 

Manual reading 

Yes 

No 

Results by Classifier 

Yes 

33 

13 

No 

5 

1 


P 


R 

.9704 


In this classification 33 is the value of TP scp in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP scp is 13 and FN scp is 
5. This means 13 documents classified by classifier are in wrong class (doesn’t 
represent class) and it has not classified 5 documents that should have been classified 
in SCP class. 1 represents that document which doesn’t belong to subclass and is 
being correctly classified as TN scp by both methods. P means Precision and R means 
Recall, there values are given as .71739 and .9704 respectively. 


5.1.2 Supply Chain Execution 


Subclass: Supply Chain Execution 

Manual reading 

Yes 

No 

Results by Classifier 

Yes 

71 

25 

No 

4 

10 



.74 


.9466 


In this classification 71 is the value of TP sce in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP sce is 25 and FN sce is 
4. This means 25 documents classified by classifier are in wrong class (doesn’t 
represent class) and it has not classified 4 documents that should have been classified 
in SCP class. 10 represent those documents which don’t belong to subclass and is 
being correctly classified as TN S ce by both methods. P means Precision and R means 
Recall, there values are given as .74 and .9466 respectively. . 


5.2 Level 1-2 

After finding out subclasses SCE and SCP. Next step is to further classify 
them into lower level subclasses. SCP is to be further classified into three subclasses 
1.) Operational Planning 2.) Strategic Planning 3.) Tactical Planning. 
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5.2.1 Operational Planning 


Subclass: Strategic Planning 

Manual reading 

Yes 

No 

Results by Classifier 

, 

Yes 

r 11 

3 

No 

0 

3 


P 

.785 

R 

1 


In this classification 1 1 is the value of TP op in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP op is 3 and FN op is 0. 
This means 3 documents classified by classifier are in wrong class (doesn’t represent 
class) and it has not classified any document that should have been classified in SCP 
class. 3 represent those documents which don’t belong to subclass and is being 
correctly classified as TN op by both methods. P means Precision and R means Recall, 
there values are given as .785 and 1 respectively. 


5.2.2 Strategic Planning 


Subclass: Strategic Planning 

Manual reading 

Yes 

No 

Results by Classifier 

Yes 

10 

6 

No 

0 

6 


P 

.625 

R 

1 


In this classification 10 is the value of TP sp in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP sp is 6 and FN sp is 0. 
This means 6 documents classified by classifier are in wrong class (doesn’t represent 
class) and it has not classified any document that should have been classified in SCP 
class. 6 represent those documents which don’t belong to subclass and is being 
correctly classified as TN sp by both methods. P means Precision and R means Recall, 
there values are given as .785 and 1 respectively. 


5.2.3 Tactical Planning 


Subclass: Tactical Planning 

Manual reading 

Yes 

No 

Results by Classifier 

Yes 

12 

2 

No 

0 

0 


P 

.86 

R 

1 


54 











In this classification 12 is the value of TPtp in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP tp is 2 and FNtp is 0. 
1 his means 2 documents classified by classifier are in wrong class (doesn’t represent 
class) and it has not classified any document that should have been classified in SCP 
class. 0 represent that no document belongs to the subclass and is being correctly 
classified as TNt p by both methods. P means Precision and R means Recall, there 
values are given as .86 and 1 respectively. 

In the similar manner SCE patent documents can be further classified into 
their subclasses at level 2. Here the classification is done into 

o Assembly Plant Management 
o Decision Support System 
o E-commerce 
o Information System 
o Inventory Management 
o Logistic Management 
o Manufacturing Control Facility 
o Miscellaneous 
o Transportation Management 

5.2.4 Assembly Plant management 


In this classification 10 is the value of TP apm in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP apm is 10 and FN apm is 
0. This means 10 documents classified by classifier are in wrong class (doesn’t 
represent class) and it has not classified any document that should have been 
classified in SCP class. 3 represent those documents which don’t belong to subclass 
and is being correctly classified as TN apm by both methods. P means Precision and R 
means Recall, there values are given as .5 and 1 respectively. 
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5.2.5 Decision Support System 



In this classification 1 1 is the value of TP dss in the documents, which 
are correctly classified by the Classifier (Analysis). The value of FP dss is 6 and FN dss 

is 1. This means 6 documents classified by classifier are in wrong class (doesn’t 

! 

represent class) and it has not classified 1 document that should have been classified 
in SCP class. 2 represent those documents which don’t belong to subclass and is being 
correctly classified as TN dss by both methods. P means Precision and R means Recall, 
there values are given as .65 and .92 respectively. 

5.2.6 E-Commerce 



In this classification 31 is the value of TP e - C om in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP e . C om is 13 and FN e . C om 
is 0. This means 13 documents classified by classifier are in wrong class (doesn’t 
represent class) and it has not classified any document that should have been 
classified in SCP class. 5 represent those documents which don’t belong to subclass 
and is being correctly classified as TN e . C om by both methods. P means Precision and R 
means Recall, there values are given as .7 and 1 respectively. 


5.2.7 Information System 
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In this classification 8 is the value of TPj nf sys in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP infsys is 3 and FN infsys 
is 2. This means 3 documents classified by classifier are in wrong class (doesn’t 
represent class) and it has not classified 2 documents that should have been classified 
in SCP class. 1 represents that document which doesn’t belong to subclass and is 
being correctly classified as TN; n f sys by both methods. P means Precision and R 
means Recall, there values are given as .72 and .8 respectively. 

5.2.8 Logistic Management 


Subclass: Logistic Management 

Manual reading 

Yes 

No 

Results by Classifier 

Yes 

8 

3 

No 

0 

1 


P 

.72 

R 

1 


In this classification 8 is the value of TPi og man in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FPi ogma n is 3 and FNi og 
man is 0. This means 3 documents classified by classifier are in wrong class (doesn’t 
represent class) and it has not classified any document that should have been 
classified in SCP class. 1 represents that document which doesn’t belong to subclass 
and is being correctly classified as TNi og man by both methods. P means Precision and 
R means Recall, there values are given as .72 and .8 respectively. 

5.2.9 Manufacturing Control Facility 


Subclass: Manufacturing control Facility 

Manual reading 

Yes 

No 

Results by Classifier 

Yes 

8 

2 , 

No 

0 

2 ' 


P 

.8 

R 

1 


In this classification 8 is the value of TP ma n con fac in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP man con fac IS 2 311(1 
FN man eon f ac is 0. This means 2 documents classified by classifier are in wrong class 
(doesn’t represent class) and it has not classified any document that should have been 
classified in SCP class. 2 represent those documents which don’t belong to subclass 
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and is being correctly classified as TN man con fac by both methods. P means Precision 
and R means Recall, there values are given as .8 and 1 respectively. 


5.3 Level 2-3 

Strategic planning is further classified into 

o Business Strategy for Supply chain 
o Enterprise and site planning. 

Tactical Planning is further classified into 

o Manufacturing Planning and Scheduling 
o Available to promise 
o Inventory planning 

5.3.1 Business Strategy for Supply Chain 


Subclass: Business Strategy for Supply 
chain 

Manual reading 

Yes 

No 

Results by Classifier 
( analysis) 

Yes 

5 

4 ^ 

No 

1 

1 


P 

.55 

R 

.83 


In this classification 5 is the value of TPb SS c in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FPbssc is 4 and FNb SSC is 
1. This means 4 documents classified by classifier are in wrong class (doesn’t 
represent class) and it has not classified 1 document that should have been classified 
in SCP class. 1 represents that document which doesn’t belong to subclass and is 
being correctly classified as TN b ssc by both methods. P means Precision and R means 
Recall, there values are given as .55 and .83 respectively. 


5.3.2 Enterprise and Site Planning 


Subclass: Enterprise and site planning 

Manual reading 

Yes 

No 

Results by Classifier 

Yes 

3 

5 

No 

0 

2: 


P 

.375 

R 

1 
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In this classification 3 is the value of TP esp in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP esp is 5 and FN esp is 0. 
This means 5 documents classified by classifier are in wrong class (doesn’t represent 
class) and it has not classified any document that should have been classified in SCP 
class. 2 represent those documents which don’t belong to subclass and is being 
correctly classified as TN esp by both methods. P means Precision and R means Recall, 
there values are given as .375 and 1 respectively. 

5.3.3 Manufacturing Planning and Schedule 


Subclass: Manufacturing Planning and 
Schedule 

Manual reading 

Yes 

No 

Results by Classifier 

Yes 

2 

1 1 

No 

2 

2 


P 

.666 

R 

.5 


In this classification 2 is the value of TP esp in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP esp is 1 and FN esp is 2. 
This means 2 documents classified by classifier are in wrong class (doesn’t represent 
class) and it has not classified 1 document that should have been classified in SCP 
class. 2 represent those documents which don’t belong to subclass and is being 
correctly classified as TN esp by both methods. P means Precision and R means Recall, 
there values are given as .666 and .5 respectively. 

5.3.4 Available to Promise 


Subclass: Available to Promise 

Manual reading 

Yes 

No 

Results by Classifier 

Yes 

2 

3 

No 

0 

2 


P i 

.4 

R 

1 


In this classification 2 is the value of TP atp in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP a tp is 3 and FN a t p is 0. 
This means 3 documents classified by classifier are in wrong class (doesn’t represent 
class) and it has not classified any document that should have been classified in SCP 
class. 2 represent those documents which don’t belong to subclass and is being 
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correctly classified as TN atp by both methods. P means Precision and R means Recall, 
there values are given as .4 and 1 respectively. 

5.3.5 Inventory Planning 


Subclass: Inventory Planning 

Manual reading 

Yes 

No 

Results by Classifier 
(Analysis) 

Yes 

3 

1 

No 

1 

2 


P 

.75 

R 

.75 


In this classification 3 is the value of TPi P in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FPj P is 1 and FN; P is 1. 
This means 1 document classified by classifier is in wrong class (doesn’t represent 
class) and it has not classified 1 document that should have been classified in SCP 
class. 2 represent those documents which don’t belong to subclass and is being 
correctly classified as TNj P by both methods. P means Precision and R means Recall, 
there values are given as .75 and .75 respectively. 


The documents, which are belonging to SCE, are already being classified up to 
level 2. There is Function that is to be run on subclasses of Level 2 for the required 
results. 

The classifications are as follow: - 


Assembly Plant Management 

o Assembly System Management 
o Assembly Line Management 
o Assembly Integration 


E-Commerce 

o Enterprise Security 
o Electronic Fund Transfer 
o Network Based Commerce 
o Web Commerce 


I 
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o Online Shopping 


Logistic Management 

o Logistic Control 
o Logistic Operation Management 

Decision Support system 

o Manufacturing Logistic Decision support system 
o Best to do match 
o Management Training System 

Information Systems 

o Inventory information system 
o Freight Distribution System 
o Supply chain network information system 
o Asset Tracking system 

Manufacturing Control Facility 

o Manufacturing Control Station 
o Manufacturing Monitoring 

Miscellaneous is classified as Supply Chain Financial Management and 

Transportation Management is classified as Fleet Management. And inventory 

Management as Inventory control. 

5.3.6 Assembly Line Management 


Subclass: Assembly System Management 

Manual reading 

Yes 

No 

Results by Classifier 
( analysis) 

Yes 

11 

7 

No 

0 

2 


P 

.61 

R 

1 


In this classification 1 1 is the value of TP a i m in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP a im is 7 and FN a im is 0. 
This means 7 documents classified by classifier are in wrong class (doesn’t represent 
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class) and it has not classified any document that should have been classified in SCP 
class. 2 represent those documents which don’t belong to subclass and is being 
correctly classified as TN a i m by both methods. P means Precision and R means Recall, 
there values are given as .61 and 1 respectively. 

5.3.7 Electronic Fund Transfer 


Subclass: Electronic Fund Transfer 

Manual reading 

Yes 

No 

Results by Classifier 
(Analysis) 

Yes 

r 11 

13 

No 

0 

4 


P 

.45 

R 

1 


In this classification 1 1 is the value of TP e a in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP eft is 13 and FN eft is 0. 
This means 13 documents classified by classifier are in wrong class (doesn’t represent 
class) and it has not classified any document that should have been classified in SCP 
class. 4 represent those documents which don’t belong to subclass and is being 
correctly classified as TN e a by both methods. P means Precision and R means Recall, 
there values are given as .45 and 1 respectively. 


5.3.8 Network Based Commerce 


Subclass: Network based commerce 

Manual reading 

Yes 

No 

Results by Classifier 
( analysis) 

Yes 

13 

8 

No 

0 

4 


P 

.619 

R 

1 


In this classification 13 is the value of TP nbc in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP n bc is 8 and FN n b C is 0. 
This means 8 documents classified by classifier are in wrong class (doesn’t represent 
class) and it has not classified any document that should have been classified in SCP 
class. 4 represent those documents which don’t belong to subclass and is being 
correctly classified as TN nb c by both methods. P means Precision and R means Recall, 
there values are given as .619 and 1 respectively. 
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5.3.9 Web commerce 


Subclass: Web commerce 

Manual reading 

Yes 

No' 

Results by Classifier 
( analysis) 

Yes 

3 

4 

No 

1 

2 


P 

.43 

R 

.75 


In this classification 3 is the value of TP WC in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP WC is 4 and FN WC is 1. 
This means 4 documents classified by classifier are in wrong class (doesn’t represent 
class) and it has not classified 1 document that should have been classified in SCP 
class. 2 represent those documents which don’t belong to subclass and is being 
correctly classified as TN WC by both methods. P means Precision and R means Recall, 
there values are given as .43 and .75 respectively. 

5.3.10 Logistic Control 


Subclass: logistic control 

Manual reading 

Yes 

No 

Results by Classifier 
( analysis) 

Yes 

2 

0 

No 

1 

0 


P 

1 

R 

.66 


In this classification 2 is the value of TPi c in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FPi c is 0 and FNi c is 1. 
This means that classifier has not classified any document in wrong class (doesn’t 
represent class) and it has not classified 1 document that should have been classified 
in SCP class. 0 represent that no document belongs to the subclass and is being 
correctly classified as TNi c by both methods. P means Precision and R means Recall, 
there values are given as 1 and .66 respectively. 


5.3.11 Logistic Operation Management 


Subclass: logistic operation management 

Manual reading 

Yes 

No 

Results by Classifier 
(Analysis) 

Yes 

9 

10 

No 

0 

2 


P 

.47 

R 

1 
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In this classification 9 is the value of TP| 0m in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP, om is 10 and FNi oni is 
0. This means 10 documents classified by classifier are in wrong class (doesn’t 
represent class) and it has not classified any document that should have been 
classified in SCP class. 2 represent those documents which don’t belong to subclass 
and is being correctly classified as TNi om by both methods. P means Precision and R 
means Recall, there values are given as .47 and 1 respectively. 

5.3.12 Manufacturing Control Station 


Subclass: Manufacturing Control Station 

Manual reading 

Yes 

No 

Results by Classifier 
(Analysis) 

Yes 

7 

0 

No 

1 

2 


P 

1 

R 

.875 


In this classification 7 is the value of TP mcs in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP mcs is 0 and FN mcs is 
1 . This means that classifier has not classified any document in wrong class (doesn’t 
represent class) and it has not classified 1 document that should have been classified 
in SCP class. 2 represent those documents which don’t belong to subclass and is being 
correctly classified as TN mcs by both methods. P means Precision and R means Recall, 
there values are given as 1 and .66 respectively. 

5.3.13 Manufacturing Monitoring 


Subclass: Manufacturing Monitoring 

Manual reading 

Yes 

No 

Results by Classifier 
(Analysis) 

Yes 

4 

2 

No 

1 

0 


P 

.6666 

R 

.8 


In this classification 4 is the value of TP mm in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP mm is 2 and FN mm is 1. 
This means 2 documents classified by classifier are in wrong class (doesn’t represent 
class) and it has not classified 1 document that should have been classified in SCP 
class. 0 represent that no document belongs to the subclass and is being correctly 
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classified as rN mm by both methods. P means Precision and R means Recall, there 
values are given as .666 and .8 respectively. 

5.3.14 Inventory Information System 


Subclass: Inventory infonnation system 

Manual reading 

Yes 

No 

Results by Classifier 
(Analysis) 

Yes 

6 

4 

No 

0 

2 . 


P 

.6 

R 

i 


In this classification 6 is the value of TPjj S in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP iis is 4 and FN iis is 0. 
This means 4 documents classified by classifier are in wrong class (doesn’t represent 
class) and it has not classified any document that should have been classified in SCP 
class. 2 represent those documents which don’t belong to subclass and is being 
correctly classified as TNij S by both methods. P means Precision and R means Recall, 
there values are given as .6 and 1 respectively. 


5.3.15 Freight Distribution System 


Subclass: Freight Distribution System 

Manual reading 

Yes 

No 

Results by Classifier 
(Analysis) 

Yes 

1 

1 

No 

2 

0 


P 


R 

.3333 


In this classification 1 is the value of TP fds in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FPf ds is 1 and FNf<j s is 2. 
This means 1 document classified by classifier is in wrong class (doesn’t represent 
class) and it has not classified 2 documents that should have been classified in SCP 
class. 0 represent that no document belongs to the subclass and is being correctly 
classified as TNfds by both methods. P means Precision and R means Recall, there 
values are given as .5 and .33 respectively. 
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5.3.16 Supply Chain Network Information System 


i 


Subclass: Supply chain network 
information system 

Manual reading 

Yes 

No 

Results by Classifier 
( analysis) 

Yes 

10 

6 

No 

0 

2 


P 

.625 

R 

1 


In this classification 10 is the value of TP scn j S in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP scn is is 6 and FN scni s is 
0. This means 6 documents classified by classifier are in wrong class (doesn’t 
represent class) and it has not classified any document that should have been 
classified in SCP class. 2 represent those documents which don’t belong to subclass 
and is being correctly classified as TN scnis by both methods. P means Precision and R 
means Recall, there values are given as .625 and 1 respectively. 

5.3.17 Asset Tracking System 


Subclass: Asset tracking system 

Manual reading 

Yes 

No 

Results by Classifier 
(Analysis) 

Yes 

8 

3 

No 

0 

1 


P 

.7272 

R 

1 


In this classification 8 is the value of TP ats in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP ats is 3 and FN ats is 0. 
This means 3 documents classified by classifier are in wrong class (doesn’t represent 
class) and it has not classified any document that should have been classified in SCP 
class. 1 represents that document which doesn’t belong to subclass and is being 
correctly classified as TN ats by both methods. P means Precision and R means Recall, 
there values are given as .7272 and 1 respectively. 

5.3.18 Best To Do Match 


Subclass: Best to do match 

Manual reading 

Yes 

No 

Results by Classifier 
(Analysis) 

Yes 

10 

13 

No 

2 

7 


P 

.43 

R 

.8333 
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In this classification 10 is the value of TP btm in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP btm is 13 and FN btm is 
2. This means 13 documents classified by classifier are in wrong class (doesn’t 
represent class) and it has not classified 2 documents that should have been classified 
in SCP class. 7 represent those documents which don’t belong to subclass and is being 
correctly classified as TN btm by both methods. P means Precision and R means Recall, 
there values are given as .43 and .8333 respectively. ! 

5.3.19 Management Training System 


Subclass: Management Training System 

Manual reading 

Yes 

No 

Results by Classifier 
(Analysis) 

Yes 

6 

9 

No 

1 

6 


mm 

£ 

■a 

Pll 


In this classification 6 is the value of TP mts in the documents, which are 
correctly classified by the Classifier (Analysis). The value of FP mts is 9 and FN mts is 1. 
This means 9 documents classified by classifier are in wrong class (doesn’t represent 
class) and it has not classified 1 document that should have been classified in SCP 
class. 6 represent those documents which don’t belong to subclass and is being 
correctly classified as TN m ts by both methods. P means Precision and R means Recall, 
there values are given as .4 and .86 respectively. 
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CHAPTER 6 


CONCLUSION AND SCOPE FOR FUTURE 

RESEARCH 


Patent documents acts as a source of Technical and commercial knowledge. It 
assists the firm in the process of decision making in the field of research and 
development, predicting market trends and its progress. There are large numbers of 
patents available in field like supply chain management. Data mining provides tool to 
analyze these patent. In this thesis we have developed Discriminant function as a tool 
for Data Mining. 

The method used in this thesis, takes in consideration the attributes present in 
the class which have larger function values as then assigned to the patent document. It 
assigns relative weight to the attribute, for a class used, based on these weights and 
frequency of occurrences for that attribute, calculating a value for these classes using 
discriminant function. We have seen that by using this metadata accuracy of method 
can be increased. We have identified a set of attributes, which can discriminant 
document in different classes. With the assistance of small manual sample, we have 
developed discriminant function for each class. 

Scope for future Research and limitation 

After analyzing results obtained by Discriminant function some observations 
which should be taken into consideration for further improving the classification axe 
as follows: 

• The number of keywords can be increased. As we go down in hierarchy of 
classification in lower level keywords becomes important in discriminating 
between subclasses. 

• We have left citation as a attribute in future research. Citation can also 
included as a attribute in developing discriminating function. 
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• Different weighting criteria can be tested for making a better discriminating 
function. 

• When in a subclass the number of documents is less (less sample size) the 
related discriminating function sometimes fail to classify documents 
appropriately. 

• For initial classification where documents are being classified as relevant and 
non-relevant for the 0 level, function 2 can be used, which is a better 
discriminator between relevant and non-relevant documents. 
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Claims 


What is claimed is: 

1 . A low profile electronic component comprising: 

a header having a recess formed within said header, said header forming a plurality of 
projections extending from said header and being generally parallel to said header, 
said projections being formed from the same material as the header; 

first and second opposing core members forming a core set, said core set being at least 
partially disposed within said recess; 

a pre-wound coil disposed at least partially within said core set such that the pre- 
wound coil is insertable into the first core member and held in place by the second 
core member, said pre-wound coil having a plurality of wire leads; and at least one of 
said wire leads being wrapped around one of said projections to form a conductive 
surface on said projection to form a component terminal. 

2. The low profile electronic component of claim I wherein said header and said core 
set are bonded together with an adhesive. 

3. 'fhe low profile electronic component of claim I further comprising a layer of 




solder disposed over at least a portion of said component terminal. 

4. I he low piofile electronic component of claim 1 wherein said pre- wound coil is 
comprised of at least two wires electromagnetically coupled together to form a 
transformer. 

5. The low profile electronic component of claim 1 wherein said pre-wound coil 
includes a wire coil to form an inductor. 

6. 1 he low profile electronic component of claim 1 wherein said projections extend 
outward generally parallel to said header allowing said component to form a surface 
mount component. 

7. I he low profile electronic component of claim 1 wherein said header is made from 
a plastic material capable of withstanding temperatures of at least 230.degree. C. 

8. I he electronic component of claim 1 wherein said first and second core members 
are made from a ferrite. 

9. I he low profile electronic component of claim 1 wherein said projections have a 
rectangular cross-section. 

1 0. 1 he low profile electronic component of claim 1 wherein said projections have an 
oval cross-section. 

1 1. The low profile electronic component of claim 1 wherein said projections have a 
trapezoidal cross-section. 

12. The low profile electronic component of claim 1 wherein said pre-wound coil is a 
self-supporting coil. 

13. The low profile electronic component of claim 1 wherein said recess extends 
entirely through said header. 

14. A low profile surface mount electronic component comprising: 

a flat header having a top and bottom surface and a plurality of edges, said header 
having a recess formed in a said top surface; 

a plurality of non-conductive projections extending from at least one of said edges of 
said header generally parallel to said bottom surface of said header, said projections 
being formed from the same material as the header; 

first and second core members forming a core set, said core set being at least partially 
disposed within said recess, wherein the first core member has a first surface and the 
second core member has a second surface parallel to and facing the first surface,; 


at least one self-supporting coil disposed at least partially within said core set between 
the first and second surfaces such that the combination of the first and second surfaces 
secure the self-supporting coil in place, each of said at least one self-supporting coils 



having at least one wire lead; and 


at least one of said wire leads being wrapped around one of said projections to form a 
conductive surface on said projection in order to form a surface mount terminal for 
the component. 

1 5. The low profile surface mount electronic component of claim 1 4 further 
comprising a layer of solder disposed on said conductive surface. 

16. The low profile surface mount electronic component of claim 14 wherein each of 
said projections is positioned perpendicular to the edge from which it extends. 

17. 1 he low profile surface mount electronic component of claim 16 wherein said 
projections are parallel to said bottom surface. 

1 8. The low profile surface mount electronic component of claim 14 wherein at least 
one of the core members defines an outer surface of the component. 
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Production scheduling: - 


Management training system: - 


1 Information system: - 

T ~ . 

0. 1 1 j 77 E-c ommerce: - 

0.03046 Logistic management: - 
0.0699 De cision support system: - 

0.4 Misc ellaneous: - 

__ 0.6 see 

1 see 

0.54069 see 

0.4595 sc_e 

0.14146 

0. 1 3285 Decision support system: - 
0.081 1 8 Informa tion system: - 
0 14146 sc ej 1 

6. 1 3K)3 ~1 

“0.10333' 

6.14853 Transport ation management: 
0.48985 Logistic management: - 

0.51 026 see 

0.6 sc e __ 

0,583 33 ~ 

0.63055 E-commerce: - 

1 Decision s upport system: - 
0.66666 1 


Network based commerce: - 

E-commerce system: - 

Logistic control: - 

Supply chain financial model: 


Asset tracking system: - 


Fleet management: - 

Logistic operation management: 



ANNEXURE E 


..All* * R cyw ord Cl aim and Ratio 3 Calculation 


KEYWORDS USED 

No. of 

documcn 

((Having 

keyword 

s) 

Subclass 

Total no. 
of 

documcn 

ts 

Weight 

Subclass 

Subclass 

Allocation 

6 

see 

283 

0.16397 



Allocation 

8 

sc p 

74 .„ 

0.8361 



Allocation 

2 

sc e 

24 

0.33333 

Assembly plant management: - 



Automatically 


Automatically 


Automatically 


4 Ass p mad 


2 tp 


14 0.66667 see 


7 lscr 


Inventory control: - 


I Assembly line management: - 


Manufacturing planning and schedulin 


Bar code 







24 0.40605 Assembly plant management: 


46 0.21 185| E-commerce:- 


57 0.17097 Decision support system: - 


46 0.2 11851 Information system: - 


1 see 


283 0.43966 


[Assembly line management: 



Assembly plant management: - 


46 0.246141 E-commerce: - I 


12 0.26959 [Transportation management: 


57 0.14189 


46 0. 140651 Information system: - 


20 1 0.28572 1 Strategic plannin 


■nmna§ 



8 

0.35938 

sc e 

Network based commerce: - 

3 

0.6389 

sc e 

Web commerce: - 



































































No. of 
documen 
((Having 
keyword 

KEYWORDS USED g) Subclass 


14 see 


iComputer implemente 


[Computer implemented 


[Computer implemented 


[Computer implemented 


Computer implemente 


Computer implemente 


Computer implemente 


Consumer 


Consumer 


Consumer 


Consumer 


Consumer 


Consumer 


Consumer 


Consumer 


Container 


Container 


Container 


Container 


Container 


Container 


Container 


Container 


Container 


Container 


Customer 


Customer 


Customer 


Customer 


Customer 


Customer 


Customer 


Customer 


Customer 


Customer 

Customer 


Customer 


Customer 


Customer order 


Customer order 


Customer order 


Customer order 


Customer order 


Data 


2 inf sys 


Total no. 
of 

documen 

Weight Subclass 


74 0.80406 


23 0.33803 


57 1 0.40921 Decision support system: 


46 0.25352 Information system:'- 


. 16 0.29421 Tactical plannin 


5 1 see 


12 0. 66666 fsc e ~ ' “ 


17 


283 0.58286 


24 0.23307 Assembly plant management: - 


0. 1 824 1 E-commerce: - 


23 0.2432 1 Logistic management: - 


57 0,09814 Decision support system: - 


2 so e 


2 Ass p ma 


2 Tr Man 


2 log man 


29 see 


8 


3 


2 1 sc e 


2 log man 


31 log man 


Subclass 


I Logistic operation management: 


lEBHBnOMI 


74 0.26759 


24 0.43707 Assembly plant management: ■ 




23 

0.19546 Logistic management:- 

57 

0.05258 Decision support system: - 

46 

0,06515 Infonnation system: - 


see 

Assembly system management: - 

sc e 

Assembly line management: - 

see 

Fleet management: - 

sc e 

Logistic control: - 


283 0.48797 


141 0.264 Inventory management: - 


24| 0, 102671 Assembly plant management: 


E-commerce: - 


23 0,2 14261 Logistic management: 


57 0.15131 Decision support system: 


46 0.13391 Infonnation system: - i 


16 0.5 1 Tactical plannin 


4l 0.68371 Isce 


15 0.27348lsce 


3 0.75 sc 


Assembly line management: - 


Logistic control: - 

Logistic operation management: 


Available to promise: - 


74 0.79305 I 

24 0.4 Assembly plant management: 


16 


Assembly line management: - 


,47703 

.52297 _ 

0.06546 Inventory management: - 


0.057281 Assembly plant management: 



























































































2 dss 


3 


5 dss i 


3infsys ! 


dinfsys 


4infsys 


man con i 


2 mis 


3 s 


0.3101 


0.20674 se e 


1 sc e 


0.48401 see 


0.51627 see 


0.23455 see 


0.43979 see 


0.32577 see 


0.29572 see 


0.50695 see 


0.19715 see 


3 

0.60809 

17 

0.39347 


Electronic fund transfer: - 


Network based commerce: - 


Web commerce: - 


E-commerce system: - 


Fleet management: - 


Logistic control: - 


Logistic operation management: - 


Manufacturing logistic decision suppoi 


Best-to-do-match: - 


Management training system: - 


Inventory information system: - 


Supply chain network information syst 


Asset tracking system: - 


Manufacturing control station: - 




Supply chain financial model: - 


Business strate 


Manufacturing planning and schedulin 


Available to promise: - 


Production schedulin 





Data element 


Data element 


Data element 


Data element 


Data element 


Database server 


Database server 


Database server 


0.2857 1 Assembly plant management: - 


0.33333 Strategic plannin 


1 see L 


m 



Assembly line management: - 




0.48937 Assembly plant management: - 


E-commerce: - 



Database server 


I Decision support syste 


i BBBB jgS E BSI 
























































































































E business 


E business 


EAS 


EAS 


EAS 


EAS 


EAS 


EAS 



Manufacturing planning and schedulin 


2831 0.40154 


46 0.4 E-commerce: - 

283 0.51454 


14 0.11 963 Inventory management: 


24 0. 1 163 1 Assembly plant management: ■ 


46 0.09305 E-commerce: - 



7 


14 sc e 


35 sc e 


28 sc e 


10 sc e 


c e 


4invman 


2 1 Ass p mai 


10 1 Ass p maJ 


2 Ass p mai 




12 0.10856 


23 0.11 328 Logistic management: - 


57 0. 1 1427 Decision support system: - 


46 0.11 328 Information system: - 


16 0.12314 Manufacturing control facili 


4 0.09851 Miscellaneous: - 


20 0.37602 Strategic plannin 


6 0.44444 sc e 


5 0.24787 see 


14 0.44262 see 


4 0.30984 see 




Inventory control: - 


I Assembly system management: 


[Assembly line management: - 


Assembly integration: - 






































































































0.55753 see 


0.44602 sc e 


5 0.28241 see 


0,35301 (see 


sc e 


sc e 


0.1366 see 


0.6 sc e 


0.4 sc e 


EAS 


EAS 


EAS 


EAS 


EAS 


Electronic commerce 


Electronic commerce 


17 

0.34738 

7 

0.30004 


Logistic control: - 


Logistic operation management: - 


Manufacturing logistic decision suppoi 


Best-to-do-match: - 


Management training system: - 


Inventory information system: - 


Freight distribution system: - 


Supply chain network information syst 


Asset tracking system: - 


Manufacturing control station: - 


Business strate 


Manufacturing planning and schedulin 

































































































































tlocumen 

t(Having 

keyword 

KEYWORDS USED s) 


Forecastin 


Forecastin 


Forecastin 


Forecastin 


Forecastin 




Information 


Information 


Information 


Information 


Information 


Information 


Information 


Information 


Information 


Information 


Information 


Information 


Information 


Information 




ce 


2 


8 


3 


2 


2 inv man 


7 Ass p mai 


3 E com 


3 E com 


8 log man 


6 0.25 see 


14 0.875 see 


12 0.40004 see 


8 0.60006 see 


15 0.8 see 


4 0.69444 see 



2 man con fl 


4 man con 1 






0.16802 see 


7 0.28803 see 


9 1 0.33385] sc e 


9 0.6677 II see 


Inventory control: - 


Assembly line management: - 


Electronic fund transfer: - 


Network based commerce: - 


Fleet management: - 


Logistic operation management: - 


Best-to-do-match: - 


Management training system: - 


Inventory information system: - 


Freight distribution system: - 


Supply chain network information sysl 


Asset tracking system: - 


Manufacturing control station: - 































































































































No. of 
documen 
t(Having 
keyword 


KEYWORDS USED s) 


Information 


Invento 


Invento 


Invento 


Invento 


Invento 


Invento 


Invento 


Invento 


I B3K9REHM 


IPA 


IPA 


IPA 


IPA 


IPA 


Item trackin 


Item trackin 


Item trackin 


Item trackin 


Item trackin 


Item trackin 


Location 


Location 


Location 


Location 


Location 


Location 


Location 


Location 


Location 


Location 


Location 


Location 


Location 


Location 


Location 


Location 


Location 


Location 


Total no. 
of 

documen 
Subclass ts 


7 

1 



74 

0.4642 

14 

0.4836 


3 inf svs 


4 Tr Man 


2 log man 


7 log man 


2 inf sys 


Weight Subclass 


i Subclass 


Assembly plant management: • 


46 0.07359 E-commerce: - 


23 0. 1471 81 Logistic management: - 


57 0.1 069 Decision support system: - 


46 0.13246 




8 0.57258 


6| 0.4|sce 




15 0.6 see 


6 0.60131 see 


Inventory control: - 


Assembly line management: - 


Logistic operation management: 


Inventory information system: - 


I Asset tracking system: - 


74 

0.73046 

283 

1 

14 

0.80282 


57 0.19718 Decision support system: - 


24 0.70377 Assembly plant management: - 


57 0.29632 


sc e 




Assembly line management: - 


283 


23 0.42502 


57 


■aami8BaaBi .uu.Ki.t i iii ijjim — | 

■nETnflHHBHHHHBHIHB 


Logistic operation management: 


Asset tracking system: - 


14 0.12985 Inventory management: - 


Assembly plant management: - 


46 0. 07904 E-commerce: - 


Transportation management: - 


23 0.21735 Logistic management: - 


57 1 0.09568 1 Decision support system: - | 


1 6 0.0568 1 Manufacturing control facility: - 


16 0.55616 Tactical plannin 


0.875 see 


sc e 


sc e 


15 0.48281 see 


6| 0.36404 1 sc e 


3 0.70175 


7 


283 




Fleet management: - 


Logistic control: - 


Logistic operation management: - 


Inventory information system: - 


Asset tracking system: - 


Manufacturing planning and schedulin 


Available to promise: - 


Production schedulin 



















































































































No. of 
docuinen 
t(Having 
keyword 

KEYWORDS USED s) 


Total no. 
of 

documen 


MRP 


IP 


MRP 


RP 


Network 


Network 


Network 


Network 


Network 


Network 


Network 


Network 


Network 


Network 


Network 


Network 


Network 


Network 


Network 


Network 


Network 


Network 


Networks 


Node 


Operation 


Operation 


Operation 


Operation 


Operation 


Operation 


Operation 


Operation 


Operation 


Subclass ts 


ce 


21 E com 


5 E com 


40 see 


13 


3 sc e 


3 sc e 


3 


2 sc e 


3 


5 sc e 


6 


6 sc e 


3 Ass p mai 


I Weight Subclass 


57) 0.573621 Decision support system: • 


I Subclass 


283 0.25887 


74 0.7425 


16 


7 


283 0.54494 


Manufacturing planning and schedulin 


I f» V/3H H U U H « 


24 0.08847 Assembly plant management: ■ 


E-commerce: - 


4 0.26542 1 Miscellaneous: - 




6 0.4|sce 


14 0.22148 see 


4 0.77519 see 


sc e 


sc e 


sc e 


0.25 sc e 


Inventory control: - 


Assembly line management: - 


Assembly integration: - 


Enterprise security: - 


Electronic fund transfer: - 


Network based commerce: - 


I Supply chain network information syst 


283 

0.4417 

74 

0.56306 


0.5 E-commerce: ■ 


0.173 


E-commerce: - 


0.8 


283 0.44588 


74 0.55418 


533 


Enterprise and site plannin 


0. 1 655 1 Inventory management: - 


0.09655 Assembly plant management: - 


0.05037 E-commerce: - 


0. 1 2873 Transportation management: - 


0. 1 0075 Logistic management: - 


0.06775 Decision support system: - 


0. 1 0075 Information system: - 


20 

1 0.4447 1 1 Strategic plannin 

m 

8 

1 0.555891 Operational plannin 


sc e 


0.6666 sc e 


sc e 


sc e 


0.5 1 see 


0.42941 see 


| Assembly line management: - 


Electronic fund, transfer: - 


Fleet management: - 


Logistic operation management: - 


Freight distribution system: - 


| Manufacturing control station: - 



















































































































Product information 


Product information 


Product information 


Purchase 


Purchase 


Purchase 


Purchase 


Purchase 


Purchase 


Purchase 


Purchase 


Purchase 


Purchase order 


Purchase order 


Purchase order 


Purchase order 


Purchase order 


Purchase order 


Record in 


Recordin 


Record in 





57 

0.1357 Decision support system:- 

46 



2 inf sys 


10 see 


2 sc e 


2 sc e 


2 sc e 


3 sc e 




0.666 see 


6 0.28571 see 


283 


46 0.18872 E-commerce: - 


I Logistic operation management: ■ 


I Inventory information system; - 


57 0.1523 Decision support system: - 


46 0.28308 Information system: - 





























































































































RFID 


RFID 


RFID 


RFID 


RFID 


RFID 


Schedule 


Schedule 


Schedule 


Schedule 


Schedule 


Schedule 




57 

0.17026 Decision support system: - 

46 

0.16878 Information system: - 



8 

0.66308| Operational planning: - 


Assembly line management: - 


Manufacturing planning and schedulin 






























































































































Time period 


Time period 


Time period 


Tracking system 


2 inf sys 


2 


5 


8 see 


3 see 


2 sc e 


3 Ass p mai 


18 see 


5 


2 sc e 


3 see 


5 sc e 


2 see 


2 log man 


2 man con 1 


24 0.78 1 25 Assembly plant management: - 


57 0.2193 Decision support system: - 


14 1 sc e Assembly line management: - 


283 0.48516 


74 0.51539 


Assembly plant management: - 


0.26529 Logistic management: - ! 


0. 1 7841 Decision support system: - 




16 0.25424 Manufacturing control facility: - 


0.6666 1 sc e 


0.6666 sc e 


Logistic operation management: 


1 Asset tracking system: - 






































































































































































ANNEXURE F 

Assignee Name 

Assignee a nd S ubclass 

International Business Ma chi nes Co rporation (Armonk, NY) 

sc e(18) 

sc p(6) 

ass plant man 

dss 6 

e com 6 

inf sys 3 __ 

mis 2 


o p p l an 2 

st plan 2 

t at plan 2 

i2 Technolog i es US, Inc. (Da llas, TX) 

sc e(3) 

sc p(7 ) _ 

dss 2 

st p(6) 

tac p 

Accenture LLP (Pal o Alto, CA) 

sce(6) 

e-c(6) ___ 

i2 Technologies, Inc. (I rvi ng, TX) 

sc e 2 

sc p(6) 

ass p man 

dss 

e com __ _ 

stp 4 

tac p __ 

CIENA Corporatio n (Linthicu m, MD ) _ 

sc e 3 

mf c fs 3 

Infineon Technologi es AG (Muni ch, D E) 

s c e 2 

ass p man 2 


Weight 

~0.439620129 

0.560416651 

0.049455984 

0.124941434 

0.154818733 

0.077409367 

0.59347181 

0.526315789 

0.210526316 

0.263157895 

' 0.1 00863052 
0 .90004371 6 
_0. 666666667 
0.705882353 
0.294117647 





0.080308384 

0.921375921 

0.515039143 

0.216858587 

0.268716 075 

0,615384615 

0.384615385 


The Chase Manhattan Bank (New York, NY) 
sc e 2 


Koninklijke Philips Electronics N.V. (Eindhov en, NL ) 

sc e 2 


0.0802181 93 

’0.922509225 


inf sys 



As signe e and Subclas s 

C l ear With Co mp uters, Inc. ( Mankato, MN) 
sc e ( 2) 

dss(I) ~ 

infsys(l) 

Ford Motor Company (Dea rbo rn, M I) 

sc e(2) 

inf sy 

mf c fac_ _ 

Ro ck-Tenn Company (Norcros s, G A ) 

sc e2 

ass p man_2 _ 

Kabushiki Ka isha T oshiba (Kawasaki, JP) 

sc e 2 

as s pi m an 

dss_ _ ___ 

International Business Machines Corp. (Armonk, NY) 

sc e 2 _ 

dss 2 

Hitachi, Ltd. (T oky o, JP) 

see 5 

ass pi ma n 1 

dss 2 


Total No. 
2 



Weight 


0.4 46635938 

0.553440184 


0.342888493 

0.657202944 


0.703703704 

0.296296296 


0.542857143 

1 



Nam e 

Abrahamsson Steffen 
A dam s Brian C 
Alibrahim Hussa m 
Altman Arthur H. __ 
Arme ntrout Olin 
Bakkalbasi Qmer 
Bakkal basi Omer 
Bar gh Adrian Ne il 
Base man Ro bert 
Bastian, II William A. 
Bellini Joseph M. 


ANNEXURE G 

AUTHOR 'S N AME"" 

subclass Name 

sc exe Li Shuchen 


sc exe 
sc exe 
sc exe 


sc exe 
sc exe 


sc exe 
sc exe 
sc exe 


Bhaskaran Kumar 
Bibbee Jeffrey N. 

Bodin Willia m K ress 
Bourne Robe rt D . 
Bowma n-Amuah M ichel 

Brandt Gary 

Braun Heinri ch 

Brown Tim 

Burchett Christopher D . 
Bumard Mike Robert 

B urney Jessica 

Callen Kevin 

Caswell Robert L. 

Cat an Carolyn Ram sey 
Chan Lap Mu i Ann 

Chen Lawton 

Che meff Jonath an M. 
Chin Goodwin R. 

Ch isolm David A. 

Chisolm David A. 

Co nnors Daniel P. 

Crawfo rd, Jr. James M. 
Csipkes Andrei 
Cudahy Gregory C . ___ 
Curkendall Leland D. 
Cybulski Eric R. 
DAgostino Vincent 
DAmelio Vince 


sc exe 
sc exe 
sc exe_ 
s c exe 
sc exe_ 
sc exe 


sc exe 
sc ex e 
sc exe 
sc exe 
sc exe_ 
sc exe 
sc exe 


sc exe 
sc exe 


sc exe 


sc exe_ 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 


Lin Tao 

Lin Tiaohua 

Lu David Jun 

Madam Vijay Kumar 

Marshall James 

Matsubayashi Michinori 

Matsui Toshinari 

Mayer John E. 

Mayer John E. 

Mikur a k Michael G. 

Miller Jef frey 

M ilne R obert J . 

Mi yasaka Kazumi 

Momyer Douglas A. 

Moore Herbert J. 

Morenz Robert G. 

Morita Tom 

Morrison Matthew J. 
Natarajan Bharath 

Nayak Nitin 

Newberry Rande W. 
Nickey Carolyn M. 

Not ani RanjitN. 

Notani Ra njitN. 

Okayama Nobuya 
Okuyama Ka tsuo 

Old en Eric M. 

O'Leary Denis 
Orzell Robert A. 

Owen Stephen 

Pape Wil liam R. 

Parasnis Abhay V. 

Par asnis A b hay V. 

Pati Mahes h C. 

Pearson Douglas R. 
Perkowski Thomas J. 
Peterson Larry C. 

Poole Elizabeth Jodi 



subclass 


sc exe 


sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 


sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 


sc exe 


sc e xe 
sc exe 


sc exe 


sc exe 
sc exe 
sc exe 
sc exe 
sc exe 


sc exe 


1 sc exe 
[sc exe 
i sc exe 


Name 

Dangat Geet aram S. 

De betaz Weylin J. 

Pehn Francis D. 

DeMuro Richard Thomas 
Desiraju Ramakrishna 
Desiraju Ramakrishna 

Di ck s on Da vid P 

Diet rich, Jr. Walter C . 
D olan A ndrew J. 

D ragon Paul 

Dulane y Earl F. 

Ehrenleitne r Fran z 

Eiger Jurgen 

Eng stro m Harold H. 
Estrada Julio 

Estrad a Miguel 

Ettl Marku s 

Evet ts Gregory A. 

Evetts Gregory A. 

Fe dergrue n Awi 

Fox Frede ric D . 

Francis R ob ert C. 

Fu Bor -R uey 

Gigliotti Samuel Scott 

Ginter Karl L. 

Gokhale Anan d R. 

Goss Lois 

Grett v e Per 

Gross Wil fried 

Hartley-Urquhart William ! 

Haverstock Paul 

Hegde Sa njay R . _____ 
Hennig Carole 

Hilerio Israel 

Hilerio Israel 

Hodges Jan N. 

Flogerton Peter B. 

Horst Robert L. 

Howie George R. 

Huang Ying 

Igawa Kumiko 


subclass 
sc exe 
sc exe 
sc exe 
sc exe 


Name 

Prabhakaran Sanjiv . 

Price Eric 

Proudfoot Andrew H. 
Radican Joseph E. 


I sc exe [Ramaswamy Sanjay Elat 2 


sc exe 
sc exe 
sc exe 


sc exe 
sc exe 
sc ex e 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 


sc exe 


sc exe 


sc exe 
sc exe 
s c ex e 
scexe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 


sc exe 
sc e xe 
sc exe 


i sc exe 


Ransford Mike J. 4 

Reid Robert L. 2 \ 

Riehl Juergen 2 

Ross G. Terry 2 

Routhier Edmond E. 2 

Sagar Ajit 2 

Sagar Ajit 2 

Sai to Hiroyuki 2 

Sasaki Katsunao 2 

Sasano Toshio 2 

Schwarten Dave A. 4 

Shah Bhaven S. 2 

Shall Bhaven S. 2 

Sharp Shawn T. 2 

She ar Victor H. 2 

Sheehan, Jr. Richard L. 2 

Shkedy Gary 2 

Sitarski Edward 2 

Sladek Marjorie 2 

Sl eep Nicholas J. 3 

Smith Michael 2 

Spahn Francis J. 2 

Srivastava Ashok N. 2 

Sud o u K ouj i__ 2 

Summers Gary J. 4 

Sw an Richard J. 4 

Tang Jung-Mu 2 

Taylor John Timothy 2 

T erabayashi Eiichi 2 

Terash ima Hiroyoshi 2 

Thomas D ennis R. 2 

thr ift Jo hn R. 2 

Tomforde Johann 2 

Tong Sai-Kit K. 2 

Transportation planning: o p 
Tyberghein Mike 4 


subclass 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 


sc exe 
sc exe 
sc exe 
sc exe 
sc exe 


sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc ex e 
sc exe 


sc exe 



sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
scjjxe 
sc exe 


sc exe 



Name 

Ishiwata Masa o 
J ohnson Ja mes M . 
Johnson J er ome D. 

Ka~J ~ 

Ka 

Kazem i Niakam 
Kennedy BrianM. 

Kil David 
Kinross RobertP. 
Kirkegaard Jo n R. 
K link Allan “ 
Koik e Hi roshi 
Krasjnski Ray 
Kroening JamesL. 
Kubo Shigeru 

Kumar Krishna 

Kunchithapatham Arun 
Kwiat kow ski Steven E. 
Landva ter D arryl V. 
Lawrence James _ 
Leung Ying Tat 



su bclass [N ame freq 

sc exe Uchibori Hidetoshi 2 

sc exe Underwood Roy Aaron 4 

sc exe Van der Veen W outer 4 

sc exe Van Wie David M L 2 _ 

sc p ! V enkatasubramanyan Na 2 


sc exe 


sc exe 


sc exe 


sc exe 
sc exe 


sc exe 


sc exe 
sc exe 


Vogler Hartmut K. 
Vrieling James 
Waller Matthew A. 


sc exe Weinand Hans- Joachim 2 


Weinstein Syd 

Weisser, Jr. P aul T. 
Whipple Mark B. 
White Jason S. 
Wilson James R. 
Wong Charles 
Yablonski Mark S. 
Ya mamoto Tsukasa 
Yao David Da-Wei 
Yen Chih-Kuan 
Yoshida Eichi 


subclass 
sc exe 
sc exe 
sc exe 
sc exe 


sc exe 
sc exe 
sc exe 
sc exe 



sc exe Zimmerman Thomas Gu 2 


sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 
sc exe 



I-Class Sub< 

gQ6f 0 07/00 Isc7 

gQ 6f 015/ 24 g08b 005/22^ J sc J 

a6 lb 005/00 dss 

a6ib 005/ Q^ 6?m97Q0"7~ see 

a61b 005/00 g0 6f 019/00 infs 

a61 n 0 01/00 __ mis 

a61 n 005/04 see 

a61n 005/04 inf s 

a63f 009/24 "infs 

b07c 005/00 man 

b2 3 b 01 3/ 04b23b 003/36 b2: see 
b23b 013/04 b23b 003/36 b2! sc e 
b23p 02 f /0Q “ 7' Ass 

b32b 031/00 b65c 01 1/04 Ass" 

b601 001/00 Ass 

b60jp 003/42 b60p 003/08 Trl 


ANNEXURE H 

I-Clas s Weig ht 

[Sub class Subclass I 


su bclass Subc lass ^Subclass 

sc e In ventory managem ent: - 

see Inventory mana gement: - 

sc e [Man agement training system: • 

see Decision support syste m: - 

inf s y s see Supply chain network informa 

mis sc e Sup ply chain fi nan cial model: 

sc e D ecision support system:_- 

s ys se e Supply c hain network informa 

int sys see Freight distribution system: - 

m an con fac se e Manufacturing control station: 

sc_e Decision support system 

see Information system: - 1 


Ass p man see 
Ass p man sc e 
Ass p m an sc_e 
Tr Man sc e 


^0e003^ Wr,^ 08 inf sys 


b60p00 7/ 08 


As s p m an see 


b 60t 00 7/1 6 g0 6f 017 /00 log man sc e 

b62d 065/00 man con fac sc e 

b63b 035/30 _ Tr Man see 

b63b 06 7/60 _ Tr Man see 

b 65b 051/3 0 b65b Q09/06b6: As s p man sc e 


b65g 001/00_ 

b65gl)21/22_ 

b65g 029/00 ‘ 

c07h 021/04 ^ 


0 01k 0 13/02 

f!6g 013/00 1, 

fl6 g"013 /T6 

gOl n 037/00 gQ 6f 019/00" 
g0In 037/00 g06f 019/66" 


Ass p man sc e 

Tr Man sc e 

Tr Man sc e 

Tr Man sc e 


Assjj man sc e 

dss_ sc e 

E com sc e 


__ Assembly line management: - 

Assembly system management 

Assembly integr ation : - 

Fl eet management: - 

Freight distribution system: - 

Assembly integration: - 

Logistic operation managemen 

Manufacturing Gontrol station : 

Fleet management: - 

Fleet management: - 

Assembly line management: - ' 

Assembly line management: - 

Fleet management: - 

Fleet management: - 

Fleet management: - 


Assembly line management: - 

Manufacturing logistic decisioi 

Network based commerce! - 


Weight 

1 

I 

1 

05 

05 

1 

\ 

11 

i 

1 

0.44664 

0.55344 

1 

1 

1 

1 


gO 1 r 023/02 gO 1 r 0 1 3/ 1 4 see _ De cision sup 

gOlr 023 /02 gO 1 r 0 1 3/ i 4 _ i nf sys se e 

gOl r 03 1/02 Assjp man _ set 

gOls 003/02 gO ls 013/08 g08 l og man see 

gO 1 s 003/0 2 g 0i s 0i 3/08 g08 inf sys sc e 

gOls 003/80 _ _ see Decisi on sup 

gOls 003/80 __ _ l og ma n see 

gOis 003/80 inf sys see 

gOJ v 003/08 __ man con fac sc e_ 

g05b 017/00 _ dss' _ __ se e 

g 05b 0 17/00 gQ 6f 017/6Q" d ss" ~ see 


s ee q uality contr ol 

Tr Man sc e Fleet management: - 

sc e Dec ision support s ystem: - • 


inf sys see __ Inventory information system: 

Assjp man sc e Assembly system management 

logman sce__ Logistic operation managemen 

inf sys sc e [Asset tracking system: - 

sc e Decisi on su p port system: - 


0.66674 

0.33337 

0.12099 


g05d 0 01 /0Q gOlc 021/ 00 g Oc log m an see 


g06f 003/00 t p 

g06f 003/14 g06f 019/00 mi 

g06f 007/00 _ _ As 

g06f 007/00*’ ’ ‘ ~~ ~ As 

g06f 007/00 lojj 

g06f 007/00 ds 

g06f 007/66 ~ ~ds 

g06f 007/66 g06g 661/14 sc 

g06f 007/00 g06g 001/14 sc 

g06f 007/00 g06g 001/14 sc 

g06f 009/00 tp 

' 06 f 009/445 E 


t p sc p 

man con fac sc e 
Ass p man s e e 
_ Ass p man _ sce__ 
l og man sc e 


Logistic operation managemen 0.29985 

Asset tracking system: - 0.14993 

Manufacturing monitoring: - 0.43 1 03 

Management training system: ■ 1 

Management training system: - : 

Logistic control: - 1 

Supply chain planning: - 
Manufacturing control station: 1 


r system manag 


Assembly line management: - 
L ogistic control: - 


0.1028 

0,6168 3 

0.10727 


Manufacturing logistic decisioj 0.08657 


see __ _ [M anagement training system: ■ 

De cision su pport systems 

In formation syst em: - ... ... 

information system: _ ; ; - . 

sc p Manufa cturing planning a nd s< 

see Network based commerce: - 


0.08657 

0.16788 

0.41606 

0.41606 



ANNEXURE I 

_ U-class 

_ U-cl ass Subclass Subclass 

235/375 235/462,01 235/462.25 1 'see' svstenv- 

235/375 235/462.01 235/487 see 

" 235/375 235/462.01 235/487 " ] 

235/380 235/375 235/379 902/26 E_c5m 7c“e 

235/383 235/385 235/375 705/18 705/22 see Logistic ma n aPft m P ,f“ 

235/ 3 83 235/385 235/375 7 05/18 705/22 inv man sc e 

235/383 235/385 235/375 705~/18 705/22 " Ecom sc7~“ 

'235/384 235/492 "~'T?M an 7c~e 

235/385 see Logistic managemen t: - 

235/385 _ . ___ sc e Decision support system: - 

235/385 sc e Decision support system: - 

235/385 see Informa tion system: - 

235/385 _ s ee Info rmation system: - 

23 5/385 _ _ log man sc e 

235/385 235/383 235/384 sc e _ Decision support s ystem: - 

235/385 235/383 235/384 sc e information system; - 

235/385 235/383 235/384 ™ logm an see ' 

235/385 340/10.34 _ _ see __ Invento ry manage men t: - 

23 5/462 .0 1 235/380" 2 35/382 ' scT ' ~ E-conunerce: - 

235/ 4 6 2.6 1 2 35/380 2 3 5/38 2 s c e Infor mati on sy stem: - 

235/472.02 235/375 235/462.45 sc e Information system: - 

235 /486 235/492 _____ see Logistic management: - 

235/4 86 235/492 ' sc e D ecision support system: - 

235/486 235/492 __ __ sc_e Infor mation system: - 

" 235/486 235/492 J __ "logman see 

235/9 lr 73/828 324/206 7 “ ’ man con fac sc e 

2 3 q/ 7 9 2 28076 SIT 1 0 /66 ’ Asspman see 

""* 29/714 29 /783 297786 29/793 J 98/346.2 198/465 Ass p man see 

"’ 29/77 1 29/78 3 29/791 29/822 52 ^752/79 752/2 see_7 Assemb ly plant management:- 

7_ 29/783 29/281,5 29/714 29/ 791 29/795 228/4.1 2 man con fac sc e 

3 24/1 58.T"324/7 52”3 24/753 __ Ass pman see 

324/76.39 32 4/76 ,24 __ __ _ sc e Decision support system: - 

‘340/192 340/579 340/825752 705/28 705/29 scT __ J I nventory management: - 

7. 34 Q/ 5 72 t 1 *340/5 ,92 340/1 0 340/505 i 700/2 1 5 7( sc e _ Logi stic management: - 

140/5711 140/192 340/10. 1 346/565 766/215 Ti'sce ” Decision support system: - 


Logistic management: 


[Logistic management: ■ 


Decision support system: - 

_ Infonnation system: - 

. Info rmation system: - 

_ sc e 

_ Decision support system: - 
In formation system: - 

see 

_ In ventory manageme n t: - 

„ E-co m merce: - 

_ Information sys tem: - 

Infonnation system: - 
_ Logistic managem e nt: - 
_ Decision support system: - 
Information system: - 


Subclass 


Electronic fund transfer: - 

Inventory control: - 

Online shopping: - 

Fleet management: - 


[Logistic operation management: 


Logistic o p eration management: - 


Logistic operation management: 
Manufacturing monitoring: - 


Assembly line management; - 

Manufacturing control station: - 
Assembly system management: - 


340/573,3 119/51 .02 _ see Logistic management: - 

340/573.3 119/51.0 2 see Decision support sys ten 

“34 0/573,3 119/ 51.02 __ see Infonnation system: - 

*340/98 8J540/43 8340/995J 9 ’ ' JlrMan see 

”342/457 23S/384346/5!42 * 340/1 671 340/1 6.5~34( infsys see , . 

"342/458 342/4^340/51 1~ 340/572.4 340/686^""' sc e Logi stic ' management: - 


Logi stic management: - 

340/572.1 J40/5.92 340/10.1 340/505 700/215 7( |s ce _ I Decisi on support system: - 
" 340/572.1 3AOf53TT4SnoTiW505 706 / 2 T 5 Wsc e "'“information system: - 


Logistic management: - 

Decision support system: - 
Infonnation system: - 


342/458 342/47 340/51 LJ40/572.4 340/686.6_ sc e 

“~3 42/4 5 8 342/4714 0/511 140/57274 3407686.6 " see ” 

342/458 342/47 340/51 1 1 40/572.4 340/686.6 logman 
345/419" 345/782 34 5/848 705/7 705/8 705/28 70 E com 
345/4 1 9 345 /7 82 345/848 7 057f 705/8 705/28 T6 Tog man 
345/4 19 345/782 3 45/848 70 5/ ?' 705/8’ 7 05/28 70 dss ~ 

34574 40 " "" ‘"sc e _ " 

345/705 3 ^9611 4 5/^ man con' fid sc e 

345/733 703/21 Asspjnan sc e 

" 345/7 33 7 03/2 f 7. “* 7~ J. E com _ see 

345/733 7 03/21 " ~ ~ s"f |sc p 

367/1 1 8 367/12 8 "1 "see" _ 

3677118 367/128" ~~ ““ see" 

367/118 367/128 * logman 

370/254 370/385 370/386 ' 370/400 379/221.05 s p 
379/221.13 345/700 379/221.14 see 


Decision support system; - 
Infonnation system: - 


379/22 U 3 345/700 379/221,14 infsys see 

379/265 02 379/266.08 ' " E com _ sc e 

^40/W4p5p40/81 5&Z, . 7™ I?li HL. 

'414/800 414/812 [ 3 Tr Man see 

41 4/800 4 1 4/8 12 " "~77 „ 

414/803 212/270 212/344 414/140.3 * Tr Man see 

463/42 infsys see 

53/550 53/20 1 53/374.4 1 56/582 1 56/583. 1 Ass p man sc e 

536/23.53 435/69.6 435/320. 1 435/328 TrMan see 

59/78. 1 59/900 248/49 E com ;sce„ 

60/660 60/6527667291 Asspman see 


_ sc p 

_ Decision su pport system: - 
Infonnation system: - 
sc e 
sc p 

Decision support system: - 


Fleet management: - 

F reight distribution system: - 


Logistic operation management: - 

E-commerce system: - 

Logistic control: - 

Management training system: - 

Manufacturing control station: - 
Assembly line management: - 
N etwork based commerce: - 
Enterprise and site planning; - 


Logistic operation management: - 
Enterprise and site planning: - 

__ Supply chain network information i 

Electronic fund transfer: - 

Management training system: - 

Fleet management: - 

Freight distribution system; - 

Fleet management: - 

Freight distribution system: - 
As sembly line m ana gement; - 
Fleet management: - __ 

_ N etwork based co m merce: - 
Assembly line management; - 


Weight 

__ i 

" 0.500901623 
' 0,500901623 

1 ^ 

0.318288879 

0,522903158 

0.159144439 

l 

0.178189594 

0.143802128 

0.143802128 

0.178189594 

0.178189594 

0.178189594 

0.212138569 

0.262867357 

0.52573471 4 

1 

0.500095018 

0.500095018 

1 _ 

0.525734714 
~ 0.212138569 
0,262867357 

1 _ 

l 

1 

r 

]_ 

l 

1 ^ 

\ 

1 

7 0.525734714 
0,212138569 
0.262867357 


0,525734714 
0.212138569 
0.262867357 
I 

77 j 

6.525734714 

0.212138569 

0,262867357 

77 L 

0.525734714 

0.212138569 

0.262867357 

I 

1 


0.525734714 

0.212138569 

0.262867357 

1 

0.446635938 

0.553440184 


0. 7936 50794 

0,207039337 

I 

7 ■ ” 1 





ANNEXURE J 





Attribute 
No. of 

Keyword pair and Ratio 3 Calculation 

i r - • i 

- - ~ 


KEYWORD PAIRS USED 

Document) 

Having 

Keyword 

)pairs 

Subclass 

Weight 

Subclass 

Subclass 

j 


bro.invento 


bro, network 


bro, operation 


bro, schedule 


bro, warehouse 


computer Implemented .data 


computer Implemented, eas 


computer Implemented, enterprise 


computer Implemented, Information 


computer implemented, mr 


computer implemented, network 


computer implemented, node 


computer implemented, schedule 


omputer implemented, supply chain 


computer implemented, time period 


consumer, container 


consumer, demand 


consumer, displa 


consumer, eas 


consumer, inventor 


consumer.ipa 


consumer, network 


consumer, networks 


Assembly line management: - 


Assembly line management: - 


Web commerce: - 


Assembly line management: - 


Assembly line management: - 


Assembly line management: - 


Manufacturing planning and schedulin 


Manufacturing planning and schedulin 




Logistic operation management: - 


Assembly line management: - 


Electronic fund transfer: - 


Assembly line management: - 


Assembly line management: * 


Assembly line management: - 


Electronic fund transfer: - 


Electronic fund transfer: - 


Electronic fund transfer: - 


consumer, performance 


consumer, purchase 


consumer, shi 


consumer, shipment 


container, customer 


container, data 


3*1 Evl 


container, eas 


container, information 


container, mvento 


container, location 


container, operation 


container, packagin 


container, performance 


container, shi 


container, shipment 

23 

container, supply chain 

117 

customer, data 

8 

customer, data 

14 

customer, data 

i 




customer, eas 


customer, eas 


customer, forecast 


customer, information 


customer, Information 


customer, invento 


customer.location 


customer, operation 


customer, operation 


customer, packaain 


0.113225 

0.434028 


0.452899 


1 


0.657203 


0.342888 


1 


0.793651 

0,207039 

1 

1 


1 


1 


1 


Assembly line management: - 


Electronic fund transfer: - 


Assembly line management: - 


Assembly line management: - 


its— 


Assembly line 


Assembly line 


Assembly line 


Assembly line 


Assembly line 


Assembly line 


Assembly line 


management: 


management: 


management: 


management: 


management: 


management: 


management: 




Assembly line management: - 


Assembly line management: - 


Assembly line management: - 


Assembly line management: - 


Assembly line management: - 


Inventory control: - 

Assembly line management: - 


Logistic operation management: 


Assembly line management: - 


Assembly line management: - 


Logistic operation management: 


Available to promise: - 


KH WESSMEEESm 


Logistic operation management: 


Assembly line management: - 


Assembly line management: - 


Assembly line management: - 


Assembly line management: - 


Assembly line management: - 



























































































































































































Weight Subclass 
1 sc e 



Subclass 

Assembly line management: - 
Assembly line management: - 
Assembly line management; - 
Assembly line management: - 
Assembly line management: - 
Logistic operation management: - 


Enterprise and site plannin 


Fleet management: - 


Management training system: - 


Manufacturing control station: - 


Inventory control: - 


Assembly line management: - 




Fleet management: - 


Logistic operation management: - 


Management training system: - 


Supply chain network information system: 


Manufacturing control station: - 






0.203423 


0.259929 


0.406846 


0.165267 


0.553287 


0.050299 


0.231374 


0.179662 


0.826446 


0.253807 


0.088281 


0.035622 


0.126904 


0.507614 


1 


0.627117 


0.365818 


1 


1 


0.400256 


0.600384 


Invento 


Fleet management: - 


Assembly line management: - 


Electronic fund transfer: - 


Fleet management: - 


Logistic operation management: - 


Management training system: - 


Supply chain network information system: 


Supply chain financial model: - 


Enterprise and site plannin 


Assembly line management: - 


Electronic fund transfer: - 


Assembly line management: - 


Electronic fund transfer: - 


Fleet management: - 


Logistic operation management: - 


Inventory control: - 


Web commerce: - 


ISupply chain network information system: 



















































































































































































































KEYWORD PAIRS USED 
data, supply chain 


NSTo? — 
Document) 
Having 
Keyword 
)pairs 
22 


Subclass 
E com 


data.supply chain 

104 

data.supply chain 

24 


data, supply chain 


data.supply chain 


data, time period 


data, tracking system 


data.transportation 


data.vehicle 


data, vehicle 


data.warehouse 


data, web browser 


demand, eas 


demand, enterprise 


demand. forecast 


demand, information 


demand, inventor 


demand, ipa 


demand, network 


demand, networks 


demand, plannin 


demand, purchase 


demand.supply chain 


display, eas 

display, eas 


display, eas 


displayjnformation 


display.information 


S raEHiiimsiiiBm 

mm mg m 


display, network 


display, operation 




display, performance 


display, performance 


display, purchase 


display, shi 


display, shipment 


display, supply chain 


display, vehicle 


eas, enterprise 


eas, information 


eas, information 


eas, information 


eas.information 


eas, information 


eas, inventor 


eas.mvento 


easJnventor 


eas.invento 


a 


eas, ipa 


eas, location 


eas.location 


eas, location 



Weight 

0.065155 


0.195466 


0.065155 


0.374644 


0.299715 


1 


1 


1 


0.74206 


0.258108 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


0.186846 


0.097485 


0.157344 


0.560538 


0.115634 


0.886525 


0.657203 


0.342888 


1 


1 


1 


0.400256 


0.600384 


1 


1 


1 


1 


Subclass 
sc e 


Subclass 

Network based commerce: - 


Logistic operation management: - 


Supply chain network information system: 


Supply chain financial model: - 




Logistic operation management: ■ 


Logistic operation management: • 


Fleet management: - 


Fleet management: - 


Logistic operation management: 


Assembly line management: - 


Web commerce: - 


Inventory information system: - 




xmsmEam i m 


Inventory information system: - 


Electronic fund transfer: - 


Electronic fund transfer: - 


Electronic fund transfer: - 


Management training system: - 


Manufacturing control station: - 


Fleet management: - 


Assembly line management: - 


Manufacturing control station: 


Assembly line management: - 


Assembly line management: - 


Manufacturing control station: 


Online shoppin 


Assembly line management: - 


Assembly line management: ■ 


Assembly line management: * 


Fleet management: - 


i— QagnEgffiElEig 


Fleet management: - 


Electronic fund transfer: - 


Logistic operation management: 


Management training system: - 




Inventory control: - 


Assembly line management: - 


Online shoppin 


Inventory information system: - 


Electronic fund transfer: - 


Management training system: - 


Assembly line management: - 


Electronic fund transfer: - 


Logistic operation management: - 


Manufacturing planning and schedulin 
Inventory control: - 


































































































































































































































KEYWORD PAIRS USED 

~ eas,network 

eas.network 


eas.network 


eas, network 


eas, node 


eas, operation 


eas, operation 


eas, operation 


eas.packagln 


eas, performance 
eas, performance 


eas, performance 


eas, purchase 


eas, schedule 


eas.shi 


eas, shipment 


eas, simulation 


eas, supply chain 

e 


eas, tracking system 


eas, transportation 


eas, vehicle 


eas, vehicle 


eas, warehouse 


electronic commerce, information 


electronic commerce, ipa 


electronic commerce, node 


enterprise, information 


enterprise, network 


enterprise, node 


enterprise, planning engine 


enterprise, supply chain 


firewall, network 


forecast, forecastln 


ui, information 


ui, vehicle 


information, invento 


information.lpa 


information, ipa 


information, location 


information, location 


information.location 


information. network 


inform ation.network 


information, network 


information, network 


information, node 


information, node 


information, operation 


information, operation 


information.operation 


information, packagin 




information, purchase 
information^ rfid 


o. of 
Document( 
Having 
Keyword 
)palrs 

17 

86 


90 


Subclass 
E com 


dss 


inf svs 



E com 


Tr Man 




Ass p man 


inv man 


JA/eight Subclass 
1 0.187676 see 


0.050486 


0.062559 


0.28777 


0.126604 


0.873566 


0.207039 


information, schedule 

25 

information, simulation 

42 

information, supply chain 

81 



Subclass 

Electronic fund transfer - 


Management training system: - 


Supply chain network information system: 


Enterprise and site planning: - 

I Supply chain network information system: 


Electronic fund transfer: - 


Fleet management: - 


Manufacturing control station: - 


Assembly line management: - 


Inventory control: - 


Assembly line management: - 


Manufacturing control station: - 


Assembly line management: - 


Assembly line management: - 


Assembly line management: - 


Management training system: - 


Assembly line management: - 


E»j33PlSESIiE3S2ffi 


Logistic operation management: 


Fleet management: - 


Fleet management: - 


Logistic operation management: 


Assembly line management: - 


Electronic fund transfer: - 


Electronic fund transfer: - 


Electronic fund transfer: - 


Enterprise and site plannin 


E 


E 


E 


E 


mmmmasm 

BMBU 



Fleet management: - 


Fleet management: - 


Electronic fund transfer: - 


Management training system: - 


Electronic fund transfer: - 


Fleet management: - 


Logistic operation management: - 


Management training system: - 


Supply chain network information system: 


0.045252 




Electronic fund transfer: - 


Supply chain network information system: 


Assembly line management: - 


Electronic fund transfer: - 


Supply chain financial model: - 


Assembly line management: - 


Enterprise and site plannin 


Logistic operation management: * 


Assembly line management: - 


Management training system: - 


Network based commerce: - 













































































































































































































KEYWORD PAIRS USED 

No. of 
Document( 
Having 
Keyword 
)pair$ 

! Subclass 

Weight 

Subclass 

Subclass 

1 information, supply chain 

9 

1 iog man 

0.181009 

sc e 

Logistic operation management: - 


information, supply chain 


information, supply chain 


information, supply chain 


information, time period 


information, transportation 


information,vehicle 


information, vehicle 


Inventory, network 


inventory, network 


inventory, operation 


inventory, performance 


invento 


inventory, purchase 


inventory, schedule 


inventory, shipment 


inventory, supply chain 


inventory .warehouse 


ipa, network 


ipa, network 


ipa, networks 


ipa, node 


ipa, operation 


ipa. purchase 


ipa, simulation 


location, network 


location, operation 
location »rfid 


location, schedule 


location, supply chain 


location.supply chain 


73 


45 


55 


67 


28 


6 


98 


13 


13 


140 


28 


50 


36 


107 


120 


91 


91 


2 


114 


162 


5 


162 


32 


125 



Tr Man 


Tr Man 


inv man 


E com 




Ass p man 


E com 


dss 


E com 


E com 


E com 


E com 


dss 


E com 


E com 


location, tracking system 

107 

location, transportation 

45 

location, vehicle 

fl 

location.vehicle 

2 

mrp, schedule 

15 

network, networks 

0 


network, networks 


network, node 


network, node 


network, performance 


network, purchase 


network, simulation 


network, supply chain 


network, web browser 


networks, purchase 


node, supply chain 


operation, packagln 


operation, performance 


operation, schedule 



0.045252 


0.5204 


0.20816 


1 


1 


0.657203 


0.342888 


0.765456 


0.232965 


1 


0.368438 


1 


1 


1 


1 


1 


1 


0.55344 


0.446636 


1 


1 


1 


1 


1 


1 


1 


1 


1 


0.489367 


0.510644 


1 


1 


0.389955 


0.610364 


1 


1 


1 


0.303196 




Supply chain network information system: 
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ANNEXURE K 

Value of each function over 25 documnt of SCP 



Patent Document? 

SP 

rp < 

DP 

Manual Reading 

Status 

Fun 1 

1.xml.txt 

1.294438 

0.901944 

0.651944 

5p 

wrong 

Fun 2 

1.xml.txt 

0.565015 

6.413782 

0.263782 ' 

rrelevent 

correct 

Fun 3 

1.xml.txt 

0.3 

0.2 

0.1 

sp 

wrong 

Fun 1 

10.xml.txt 

1.621056 

0.258127 

0.008127 

sp 

correct 

Fun 2 

10.xml.txt 

0.92777 

0.04175 

0 

sp 

correct 

Fun 3 

10.xml.txt 

0.8 

0.5 

0 

sp 

correct 

Fun 1 

1 1 .xml.txt 

2.411038 

0.724538 

0.474538 

sp 

correct 

Fun 2 

1 1.xml.txt 

1.271494 

0.211066 

0.061066 

sp 

correct 

Fun 3 

11.xmi.txt 

0.8 

0.4 

0 

sp 

correct 

Fun 1 

12.xml.txt 

1.957456 

1.795822 

1.545822 

sp 

wrong 

Fun 2 

12.xml.txt 

0.33457 

0.442921 

0.18457 

irrelevent 

correct 

Fun 3 

12.xml.txt 

1.3 

0.7 

0.6 

sp 

wrong 

Fun 1 

1287.xml.txt 

2.424323 

2.51222 

2.174323 

sp 

wrong 

Fun 2 

1287.xmi.txt 

1.257587 

1.221269 

1.071269 

sp 

wrong 

Fun 3 

1287.xml.txt 

0.5 

0.3 

0.2 

sp 

wrong 

Fun 1 

1293.xml.txt 

0 

0.005506 

0 

tp 

correct 

Fun 2 

1293.xml.txt 

0 

1.78E-05 

0 

tp 

correct 

Fun 3 

1293.xml.txt 

0.2 

0.1 

0 

sp 

wrong 

Fun 1 

13.xml.txt 

0.536467 

0.39244 

0.14244 

irrelevent 

correct 

Fun 2 

13.xml.txt 

0.139252 

0.073112 

0 

irrelevent 

correct 

Fun 3 

13.xml.txt 

6.9 

1.9 

14.6 

op 

wrong 

Fun 1 

14.xml.txt 

0.166765 

1.514856 

-0.083235 

tp 

correct 

Fun 2 

14.xml.txt 

0.628225 

0.015 

0 

sp 

wrong 

Fun 3 

14.xml.txt 

0.4 

0.6 

0.2 

tp 

correct 

Fun 1 

15.xml.txt 

1.50442 

1.204087 

0.954087 

sp 

correct 

Fun 2 

15.xml.txt 

0.555915 

0.563785 

0.405915 

tp 

wrong 

Fun 3 

15.xml.txt 

0.8 

0.5 

0.2 

sp 

correct 

Fun 1 

16.xml.txt 

0.428549 

0.948108 

1.2287 

op 

correct 

Fun 2 

16.xml.txt 

0.160816 

0.403741 

0.5098 

op 

correct 

Fun 3 

16.xml.txt 

0.15 

0.3 

0.5 

op 

correct 

Fun 1 

18.xml.txt 

0.656011 

0.260597 

0 

sp 

correct 

Fun 2 

18.xml.txt 

0.643186 

0.53398 

0 

sp 

correct 

Fun 3 

18^xml.txt 

3.1 

2.5 

0 

sp 

correct 

Fun 1 

19.xml.txt 

0.808724 

2.514805 

0.558724 

tp 

correct 

Fun 2 

19.xml.txt 

0.640076 

0.95375 

0.490076 

tp 

correct 

Fun 3 

19.xml.txt 

0.7 

0.7 

0.2 

sp/tp 

classify 

Fun 1 

19-8.xml.txt 

1.755877 

0.914551 

0.664551 

' sp 

correct 

Fun 2 

19-8.xml.txt 

0.900421 

0.269117 

0.119117 

sp 

correct 

Fun 3 

19-8.xml.txt 

0.6 

0.35 

0.3 

sp 

correct 

Fun 1 

2.xml.txt 

1.970061 

0.567839 

i 0.317839 

1 sp 

correct 

Fun 2 

2.xml.txt 

1.02317 

0.125294 

0 

1 sp 

correct 

Fun 3 

2.xml.txt 

1.7 

0.2 

: 0.3 

; sp 

correct 

Fun 1 

3.xml.txt 

1.34241 

0.712798 

1 1.587 

op 

correct 

Fun 2 

3.xml.txt 

0,182716 

i 0.300372 

1 0.2576 

i tp 

wrong 

Fun 3 

3.xml.txt 

0.1 

0.1 

0 

i sp/tp 

wrong 

Fun 1 

4.xml.txt 

1.647881 

0.359378 

i 0.2541 

tp 

correct 

Fun 2 

4.xml.txt 

0.94917S 

I 0.063572 

! 0.5564 

■ sp 

iwrong 

Fun 3 

4.xml.txt 

3.1 

0.2 

! 0.6 

i sp 

Iwrong 




Patent Document* SP 

TP 

OP Manual Reading 

Status 

Fun 1 

5.xml.txt 

2.346531 

0.641733 

0.4567 sp 

correct 

Fun 2 

5.xml.txt 

1.493742 

0.134683 

0.0234 sp 

correct 

Fun 3 

5.xml.txt 

2.1 

0.3 

0.9 sp 

correct 

Fun 1 

6.xml.txt 

3.577963 

0.655524 

0.45567 sp 

correct 

Fun 2 

6.xml.txt 

2.032603 

0.159799 

0.2345 sp 

correct 

Fun 3 

6.xml.txt 

0.94105 

0.17 

0.08 sp 

correct 

Fun 1 

7.xml.txt 

2.20796 

0.613392 

0.1575 sp 

wrong 

Fun 2 

7.xml.txt 

0.452413 

0.133654 

0 irr 

correct 

Fun 3 

7.xml.txt 

3 

0 

6.5 sp 

wrong 

Fun 1 

7-14.xml.txt 

1.647881 

0.359375 

1.15 sp 

wrong 

Fun 2 

7-14.xml.txt 

0.949179 

1.063572 

0.0987 tp 

correct 

Fun 3 

Fun 1 

7.14.xml.txt 

7-21.xml.txt 

3 

1.34241 

0.2 

0.712798 

0.6 sp 

0.325 sp 

wrong 

wrong 

Fun 2 

7-21.xmi.txt 

0.882716 

0 300372 

0.452 sp 

wrong 

Fun 3 

7.21.xml.txt 

1 

0.1 

0 sp 

wrong 

Fun 1 

7-22.xml.txt 

1. 970061 

2.567839 

1.254 tp 

correct 

Fun 2 

7-22.xml.txt 

1.22317 

1.125294 

0.5673 sp 

wrong 

Fun 3 

7.22.xml.txt ' 

6.8 

~~ 5.2 

0.3 sp 

wrong 

Fun 1 

8.xml.txt 

1.487447 

0.727884 

1.645 op 

correct 

Fun 2 

8.xml.txt 

0.627025 

0.288025 

0.8275 op 

correct 

Fun 3 

8.xml.txt 

0.5 

~~ 0 6 

0.3 sp 

wrong 

Fun 1 

9.xml.txt 

1.482825 

0.440131 

0.325 sp 

wrong 

Fun 2 

9.xml.txt 

1.532815 

0.154605 

sp 

wrong 

Fun 3 

9.xml.txt 

1.1 

0.4 

1 0.6 sp 

wrong 


